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CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AA7AACCCAT TTTCAAAACT AGCAGTAATA 192 0 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 198 0 

T CT AAAATG A GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 2 04 0 

TTAt t ATTGA AATCgATACT AATAtCTccC TG 2 072 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

20 CGTCTTCTCT TGGTTATTCT CTGTGTTTTG TCTTTGTTCA AATTCGATTT TTTGTTGTTC 6 0 

GAATGAATGT AATACTTTAT TTTTTTTTGG CACATAATCC AAGTnATTTT TAGGAATTAA 12 0 

TATACGGTCT TTAAATGCTT CTTGTATTTT GCTCACAATC AATTGGTATA GTTGCTCTTC 18 0 

- 5 TTTTGATAAA CGCACTTCTA GTTTTGTTGG ATGAACATTT ACGTCTACTA AGATTGGATC 24 0 

CATTTCAATA TTAATATAAC AAATCGGGAA CCTACCTATT GTTAAGAGTG TATGATAGCC 300 

TTCTAAAATC GCTTTATTTA GCATAAAGTT TTTAATGTAT CGTCCATTAA TAAAAATAGA 3 60 

30 

AATATAGTGC TTATTACTTC TAGAATGTTC AGGCTTTGCA ACAAAACCTT CAATGTGATA 4 20 

ATCACTTGTA TCTCCAGATA TATGTACTAA ATCTCGTGCA ACTTTCATCC CATAAATCTC 4 80 

35 TGCCATCACT TCATTAGTTC GTCCTGAACC ATTTGTACTT AACATTGTTT TGCCATCTGA 54 0 

AATGAGTGCT ATTCGAATGT CCGGATGGCT CATTGCCATT CTGTTGACAA TATCTGTTAT 6 00 

TTTACCTAGT TCAGTGTATA AACTTTTAAT ATATTTTAAA CGTGCTGGTG TATTATAAAA 660 

10 TAATGATTCT ACAAGTATAT CTGTTCCTTT TTTCGCTTTT GCAGGCTTAT GATTTAATAT 72 0 

TTCACCATTT TCTACATATA TTTCATTTCC ATTAGCATTA TCCGTGCAAG TCTTCAATGT 780 

TACTTTAGCA ACTGATGAAA TACTGGCTAG TGCTTCACCA CGGAATCCTA ATGTCCTAAT 84 0 

45 

ATGAAATAAA TCTTCATCTT GATCTAATTT ACTAGTCGCA TGTCTATGAA ATACTAATCC 900 

TAAGTCTTCC GCTTCAATTC CGCTTCCATT ATCGACTACG CGAATAGATT GGACGCCAGA 960 

tTCCTCTACT TCAATGCTTA TTTCTGTAGC GCCTGCATCT ATAGCATTTT CCAACAGTTC 1020 

50 

TTTCACAACA GAACTCGGTC TTTCAACTAC TTCACCTGCT GCGATTTTAT TTGCTAATGA 1080 

GGTTTGGAGT TCTTTAATTT TCCCCATTTT GCAACACCTC TATTTTAATT GATTTTGTAA 114 0 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT TTCAAACAAA TcAAATGATG CyTGTTCAAA 12 60 

GTCTTTTTGA GATAAAGTAT CaGTTGTTTC TTCaACACTT aAGTTTAAAT TTTCTTGATT 1320 

AATTTCCAGG TTCATTTTCG ACCATTTTTA AATTTGATAT CGATGATtTT TCACCAGCAG 13 80 

ACG CTTCAAA CTCGCTTAGA ATCACTTGTG CTCTGCTAAT AACTTTTTCA GGTAAATCAG 144 0 

CTAATTTCGC AACTTGAATA CCATAAtATC GTCAACTGCA CCATCTTTGA CTTTATGCAA 150 0 

GAATATAAGT TCACCTTTAT ATTCATTAGC AGCGACGTGA ACATTTTTTA GACTTGGTAA 1560 

TGCTTGATCT AATGTTGTCA ATTCATGATA ATGTGTTGAA AATAACGTTT TAGCATGTGA 1620 

TGTTTCAGCT ACATACTCTA TCATTGCCTG CGCTAAAGCT AAACCGTCAT ACGTTGAAGT 1680 

ACCACGTCCA ATTTCATCGA AAATAATCAA ACTATCCTCT GTTGCATAAG TTAATGCCTT 174 0 

TTGTGCTTCT AGCATTTCTA CCATAAACGT ACTCTTACCT GAAACCAAAT CATCTGCCGC 1800 

AC CT ATTCTA GTGAATATTT GATCAAATAT AGGTAACACT GCCTCTTTAC AAGGGACATA 1860 

AGCTCCCATT TGGGC CATTA TACTAATTAT GGCAACTTGT CTCATATATG TCGATTTACC 1920 

AGACATATTC GGACCTGTAA TTAAATATAT AAATGTTTCA TTATCTAATC GACAATTATT 1980 

AGGCACATAG TCATTATAAT CCATTACTCT TTCCACTACT GGGTGCCTAG ATTCCACTAA 2 04 0 

TTCTAATGTT TTATTTTCAC TAAATGAAGG CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

TGCAAAGCTC TGTAAACAAT CTAGCTCTGA AATAATTTTA GCTTGTTGTT GTAAACGTTC 2160 

AGTATATTTT TTAACTTCTT CACGTAGCTG AACAAATAAT TG AT ATTCTA ATTCGATGGC 2 220 

TTTGTCTTCC GCACCTAAAA TGATATCTTC TTTTTCTTTA AGTTCATCAG TTATAAAACG 22 80 

TTCAGCATTC GATAACGTTT GCTTCCTCAT ATAACCAAAT TCACTTGGTT CAAAATTTTG 234 0 

CAAGTTGGCA CGTGTTATTT CTATAAAATA ACCAAACACT TTATTAAAGC TTATTTTCAA 24 00 

TGATTTTATT CCTGTACGTT GTCTTTCTTT GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 24 60 

40 GTTTTTTGAA GCTTCAAGAT ATTCATCTAA TTGCGTATTA AAACCAACTT TGAATAGTCC 2 52 0 

GCCATCTTTA ACTGAAATTG GTGGTTCTTC TACTAAACTC TGTTCTAATA TATCAAGTAA 25 80 

ATCATCaAGG GGTTCTAGTT GATTAACTTG TACAAGAGTA TTCTGATTCA TAGAATTTAG 2 64 0 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT GGAATGTTTA AGTTGAATTA AATCTCTCGC 2700 

ATTAACATTT CCGTAACTAA CACGCCCAAC AAGACGTTCA ATATCATACA CTTGATTAAG 2760 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA ATGAGCACTA AATTCATCAA CGATATCTAA 2820 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2 8 80 

TGCTCCCATT GGTGTTTTCG TTTCGTCCAT TAGCCAAAGT AGCGTTCCTT TTTTTGATTT 2 94 0 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3 060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 312 0 

5 CACACTATAG ATTtCTGATG ATAACGTTTC CCTGACTGT 3159 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

TAAGCGAGAA GCAATTGGTG TTATGTATGC TAGTGATAAA CCAACAGGTG AAAGTACAAG 6 0 

20 GTCATTTGCT GTTTATTTCT CTCCTGAAAT TAAGAAATTT ATTGCAGATA ATTTAGATAA 120 

ATAAATCATC CATCCATACA TTGATAAATG ATTTTyAGAA ATTAACAACA AAATCAACAA 180 

TTTTAAACAT CTCTGTGATT CTATTTATTC GAAATGATTT AAAAAATAAA ACTTCAAAAA 24 0 

25 CCTAACCTTA TATTTATACG AATACTTAGA GGAGCACAAA AATGAATAAA AATATAATCA 3 00 

TCAAAAGTAT TGCAGCATTG ACGATTTTAA CATCAGTGAC TGGCGTCGGC ACAACAGTGG 3 60 

TTGAGGGTAT TCAACAAACG GCTAAAGCTG AACATAATGT GAAACTAATC AAAAATACTA 420 

30 

ATGTAGCACC ATACAATGGT GTCGTTTCGA TAGGATCTGG AACAGGTTTC ATTGTCGGTA 4 80 

AAAATACAAT TGTTACCAAC AAGCATGTCG TTGCAGGTAT GGAAATTGGT GCACATATTA 54 0 

TAGCGCATCC CAATGGTGAA TATAATAATG GCGGATTTTA TAAAGTTAAA AAAATTGTCC 6 00 

35 

GTTATT CAGG TCAAGAAGAT ATTGCCATTC TACATGTGGA AGATAAAGCT GTTCATCCAA 6 60 

AAAACAGGAA TTTTAAAGAT TACACAGGCA TTTTAAAAAT AGCATCAGAA GCTAAAGAAA 720 

40 ATGAACGCAT TTCAATTGTT GGCTATCCAG AACCATATAT AAATAAATTT CAAATGTATG 7 80 

AGTCAACAGG AAAAGTGCTG TCAGTTAAAG GCAACATGAT TATTACTGAT GCTTTCGTAG 84 0 

AACCAGGCAA CTCAGGTTCA GCTGTATTTA ACAGTAAATA CGAaGTtGTA GGTGTTCACT 900 

45 TTGGTGGAAA CGGCCCTGGA AATAAAAGTA CAAAAGGATA TGGTGTTTAT TTCTCTCCTG 960 

AAATTAAGAA ATTCATTGCA GATAACACAG ATAAATAAAT CCTTACATAG ATAAATGATT 1020 

TTAAAAATTA ACAACAAACT CAACaATTCA AATCATCTCT GTGATTCCAT TTATTCGAAA 1080 

50 

TGATTAAAAA AAATAAAACT TCAAAAAGCT AACATTATAA TTATACAAAT ACTTAGAGGA 114 0 

G C AG AAAAAT GaATAAAAAT ATAATCATCA AAAGTATTGC AGCATTGGAC GATTTTTAAC 1200 
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(2) INFORMATION FOR SEQ ID NO: 233: 

<i> SEQUENCE CHARACTERISTICS: 
s (A) LENGTH: 6444 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
TGATAAGTCA TTTAAATTGT CACCTATTGA CATGACTTCT TTCATTTCAA TCCCTAATCT 60 
15 TTCGGCAATT GTTTCTAGCG CAATACCTTT TTGTGCATCT GAATGCGTTA TTTCTATATT 120 

TCCTCTCGAA GATGATGATA TAGCTAAATT CGGAGAkTCA GCTAAAATTT TACTAGCTTT 180 
GTCAATTTTT TCTAAATTTC CATCAAATGC TAATATTTTC ATAATTAATT CACCAGGTAT 24 0 

20 

GTTTTCAATA GCATCATAAT TATCAACAAC TyTCAACGTA CCATTATCTA TGCGTCTTTG 3 00 

AATACCATTT TTAATACGCT CAACGTTTGC ATGTTGACCT GCACGCTCAG CAATATCTAT 3 60 

GTAAATGTCT AAATCTCTTT GTGGATCTTC AGTATAAATC GCACGACTCG TGTATACTTG 4 20 

25 

ATAATAAATA CCTGCATCTT TTAAAACATT TGTAATTTTG TGTACTAACG ATTTATTAAG 4 80 

GTGTGAAGTG CTCATTACAT TGAAAGTTTC ATCACGTACT TCAGCACCAT TCAAACAAAT 54 0 

ATATGGTACT GTTAAATCTG TGTCAGCAAC TGGTGCTTGk GCTTCATAAA ATGCTCGACC 6 00 

30 

TGTCGCGATA ACAACCGTTA TCCCTTGTTC TTGAGCGTAT TTAATCGCAT CAATATTAGG 660 

TTGAGAAATT TCATGTGCTG CATTAAGTAG CGTGCCATCC ATATCAGTGG CTATTAGTTT 72 0 

35 TATCATTATG TnACCTCGTT TCGTAAATnT AAAATCTTGT TCTTAAATAA GrATATATAC 7 80 

TCAGCGCACA TACTTTtCTA TTAmCATTTA TATkGTCATT aATTTATCAT ATAATGTAAT 84 0 

TCTaACAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG TGGATTTCTT ATTGGCTCTT 9 00 

AAgTTTTTAA AAAATGTTGT TAATAATGTG CTACATGCTT CTTTAAGTAC ACCTTTATCA 960 

ACAATTGCAC GATGATTAAA ATTAGATTGT TGCAATAAAT TCATTAAACT GCcACTACAA 102 0 

CCACCTTTAG GATCATCTGC GCCATAGACG ACTCTTGGAA TGCGACTCAT TACAATTGTT 1080 

CCTGCGCACA TGACACATGG TTCTAAGGTT ACATATAATG TGCAACCTTC TAAACGCCAA 114 0 

CTACCTAACA CTTTGGCTGC ACGTTCAATT GCAATATGTT CAGCATGCGC CGTTGGTTGT 12 00 

5Q TGTAGTGTTT CTCTTaAATT ATGTGCTCTA GCGATAACTT CATCATCTTT AGTGATGATA 12 6 0 

GCACCTATAG GTACTTCGCC TAGTTGAGCT GCTTTTTTAG CTTCTTCAAT CGCTAATGTC 1320 

ATAAAATATA TATCATTTGT CATTTATGTC CAGATACCTC ACTTATGGTA CAATACTCAA 13 80 
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CTATTGGCGT AGGTAAATCT TCACTTGCAC ACAAATTAAG TCAAACTTTA GATTTTTATG 1500 

AAGAAAAAGA AATCATCACA GAAAATCCAT TTTTATCAGA CTTTTATGAA GATATCTCTA 1560 

AATGGAGTTT TCAAACTGAA ATGTTCTTTT TATGCAATAG AT ATAAG CAA TTTCAAGATG 1620 

TAACACAACT AAATCAAGGT GTAGTTAGTG ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

CTAAAAATAC TTTGAGTTCT GTTGAATTTC AGAAATTCAG TAAAATTTAT GATATTTTAA 174 0 

CTGAAGATAT GATTATGCCG AATATGATTA TCTTTTTAGA TGCAGACCTT GATGTGTTAA 1800 

AATCTAGAAT TGCTAAACGT AACCGTAGTT TTGAGCATCA AATAGAaGtG AtAcTGTaAg 186 0 

TTAAAAAAAG ATTATCGTGA GTATTATGAG TCCTTACAAA GTAATGGTTC AAATGTAGTT 192 0 

TTAATCGATA CnACTTCTAT TGATTTTCTT AAAAATGAAC AAGATTACGA AGATATATTA 198 0 

CATATTATAT TACCTATGAT AGGAGATATT ACCAATGAAT AATTACGGTA TTCCACAAAA 204 0 

20 T G C C ATT AT A ACCATTGCAG GTACAGTTGG TGTTGGAAAA TCAACACTAA sGCAAGCACT 2100 

TGCAGATAAA TTAAACTTTA AAACGTCTTT TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TAAATTTTAC AGCGATTTTG AACGATGGAG TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

25 ACGTTTTAAA GAACAAAAGC GTATGTTTGA ATATGGTGGT GGCTTTGTCC AAGATCGATC 22 80 

AATTTATGAA GATGTTGATA TTTTTGCAAA AATGCATGAA GAAGAAGGCA CAATGAGTAA 2 34 0 

AGAAGATTTC AAAACATATT CAGACTTATT TAATGCCATG GTCATGACAC CTTATTTTCC 24 00 

30 

TAAACCTGAT GTAATGATTT ATTTAgAATG TAACTATGAT GAGGTCATTG ATCGTATTAT 24 6 0 

TGAACGTGGT CGCGAAATGG AAATTAATAC AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

ACGCTATGAC GATTGGATTA ATAGCTTTAA TGCATGTCCA GTTGTACGTA TCAATATTAA 2580 

35 

TGAATATGAT ATCCATAAGG ACCCCGAATC TTTAAATCCT ATGATAAACA AAATTGCTCG 2640 

AATTATTCAA ACATATCGAC AAGTAGATAC ACGATAAAAG ACTAAAGACA TAGCGTATAT 27 00 

40 GTTTATATTC AATGTATATT CCATAGATAT TATCGATTAT TTTATCAATT CTATCGAATA 27 6 0 

CATTAATTCA CATATACACT ATGTCTTTCT TTTTAATTTA AAGCTTCTAA AATATCTGCC 2 820 

GCACTATTTA AAATAATATC AGCTTCATGT AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 28 80 

45 ACACCTATTG CCATACCTAA ATTTGCATTA CTTGCTGTCT TCA.TATCATT AGCAGTGTCT 2 94 0 

CCTACTATAG CTACTTTCTG AGGATCTACA TTATATTGCT CAAATAAAGG CGATAATACT 3 0 00 

TTAGGATTTG GCTTCTCATA GGCATCCGCT TCGGTAGAAA TGATCAAATC GAACAACGAG 3 060 

SO 

GTAGCATTGG T ATGTG CTAA AAATTGTTCT ACACCTTTTT TAGTATCACT CGTAACAATA 3120 

CCAAGTTGAT AGCCTTTTGC TTTCAAATCG ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 
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GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 3 300 

ATCACTGATT TTGGATCAAT AGATT CTTTA ATGACACCGA GTTGTCTTAA AGCAGCTTCT 3360 

5 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTACAA ATCGTACCCC TATTTTTTCC 34 2 0 

CAACTTCTAT CAAATTCAAT TAACGTACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 8 0 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 3 54 0 

w 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3 600 

ATCCCCTTCA ATACTTAACA ATAAACATTG TAAACTTAAT TTATTACCAT GCTTCGCTTC 3 66 0 

1S ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3 72 0 

TTTTATTAAT ATTTTGGTAC AACGACTCTC CAACCATTTT TATCTTCTAA AGTACCATTT 3 78 0 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 3 84 0 

20 ATAACGATTT CACGATCTTC GTATCTCAAT GTACCCACAG GTGAAATAAC TGCTGCAGTA 3 90 0 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAATAA TTCATCGATT 3 960 

GAAACGCGGC GCTCTTCGAC TTCATATCCT AAGTTTTTAG CTAATTCGAT AATAGATTTA 4 02 0 

25 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4 080 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT TCGATATATT TCTGTTCAAC ACCATCAAGC 414 0 

CATAATACTT GGTCATAACC TAATTTATTT GCATTAGTTT GTGCTAATAA ACTTGcCGCA 42 00 

30 

TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC ATATTCATCT 426 0 

TCTACATAGA TTTTAGTTGG TTTTAAAGTT TCACCACCAT AATATGCACC TGAAGGAGAT 432 0 

35 AAAATAATTA ATAATTTATA CTGATGTGAT GCACCAACGC CAAGTGCCCC TTCTGTTGCA 43 80 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 444 0 

ATATCAACTA ATTGTTTTAG CCCCTCTAAC AATTCTGCTT CGTCTACTTG AGGCATTTCT 4 50 0 

4 0 AATCGTGCTA ACGAGTTATT AAGACG CTT A AAATTTTCTT CAGGACGGAA AAGTGCAACT 4 56 0 

TCCCCATCTC TTTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4620 

TGTGCAGCAG GTGAAATTTC AATAGGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 46 80 

45 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 4 74 0 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4 800 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4 860 

50 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4 92 0 

AGACTATATC TTTTAGTATA AACGTATTAA TTTCACAGAG ACAAGTAATC TGTGTTTTAC 4 980 
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TAAGTATAAT GAATAATATT AGAATTCATG CACTAGTTTA TTAAAATAAA GAGTAATTTA 510 0 

AAATATCATT CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 516 0 

5 TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC S2 2 0 

AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA TCCGGCCAAC TATTAGCGAT 52 8 0 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA TTGGGATAGT ATTCTTGAAT 534 0 

w 

TGCTTCTTTT ACCATCTCTG GATCAAAACT CATTGCGCTC AAATTATAAC CATTTCTAGT 54 0 0 

TTCTAATTTA GCGTCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CATCATCCAT 54 6 0 

ATACATCATA TCCATATACG TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 552 0 

15 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCACCTGGCT CTTTAACATG 558 0 

CGAGATTAAA CCTGGGAATC TAACACTTCT TGTATCTACA CCAAAACGTT TGAAATAGTA 564 0 

OQ TTGACACAAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 5 70 0 

CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 576 0 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 5 82 0 

25 ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 5880 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CACGATCTGT 5 94 0 

7ACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6 000 

30 ATCTGTAGCA AGAACATTAT CTGTCCCATA AATTTCTCTG CACTTAACAA CTAATTCTGT 6 06 0 

ACCAATTTGT CCTAATGCAC CAGTAATCAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 612 0 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCA TAATAATTAT 618 0 

35 

AATATAAAAG CGCTTTCTTG TATATATGAC ATGTACATGT TGCTGATATk TCTGTAAATG 624 0 

GAAATTCTAG TTGTATTAAT TGATTTTAGT AATTTATAGC GTTTATTATT GCTAATTACT 6300 

GtCAAATTAA ATTTTTTATC CCTCAACTCT T AAA CT CTGG ATATCTTTCA TTATATTAGC 636 0 

40 

TTTTTTATAA CCATGGATAT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 64 2 0 

TTGTACTTnT TCCGCTTCTG GATT 64 4 4 
45 (2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 
CO STRANDSDNESS : double 

50 (D) TOPOLOGY: linear 
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GCCCATGAGA CAATTTTACT TGCTTTTCCC ATTGGTTATC ACGTT CTTAT TACATAGATT 60 

TAAACCGAGA AATATTATTC AAACG CTATT TATTGTATCG TTGATTTCTT TAGGACTTAT 12 0 

GATAGTGATT CATTT CAT CA CTGGAGATAA TTCACGTGTG TATTTTGGGA CAGATACACG 180 

ACTGCAAACT TTATTGCTTG GTTGTATATT AG CATTT ATT TGGCCTCCGT TTGCTTTGAA 24 0 

AAAAGATATT TCTAAAAAGA TTGTCGTATC ATTAGATATT ATAGGGATAT CTGGTTTTGC 3 00 

GGTTCTAATG ACTTTGTTCT TTATAGTTGG AGACCAAGAT CAATGGATCT ATAATGGAGG 360 

ATTTTACATT ATATCATTTG CAACTTTATT CATTATTGCA ATTGCGGTAC ATCCTTCTAG 420 

TTTATTTGCT AAATTTTTAA GTATGAAACC TTTACTAATT ATAGGTAAAC GAT CAT AT AG 4 80 

CTTATACTTA TGGCATTATC CTATCATTGT TTTTGTGAAC AGTTATTACG TACAAGGACA 54 0 

AATACCGGTA TAC GTTTAT A TTATAGAAAT TTTGTTAACA GCGTTAATGG CTGAAATTTC 6 00 

GTATCGCTTT ATTGAAACAC CTATACGTAA AAAAGGATTT AAAGCTTTTG CATTTTTACC 660 

TAAAAAGAAG GGGCAATTTG CTAGAACAGT GTTAGTTATC CTATTATTGG TTCCGTCTAT 720 

CGTTGTGCTC AGTGGACAGT TTGATGCACT TGGCAAACAA CATGAAGCCG AGAAGAAAGA 780 

GAAGAAGACG GAATTTAAAA CAACGAAGAA AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 84 0 

TAAGCAGACA GCGAATAGCA AAGAGGATAT TAAAAAGTCA TCACCACTAT TAATTGGTGA 900 

CTCGGTCATG GTGGATATTG GTAATGTCTT TACTAAGAAA AT AC CAAATG CACAAATTGA 96 0 

TGGTAAAGTT GGACGGCAAC TCGTTGATGC TACACCAATT GTGAAATCGC AATATAAAGA 1020 

CTATGCTAAA AAAGGTCAAA AAGTTGTAGT AGAGCTTGGT ACAAATGGGG CATTTACGAA 10 80 

AGATCAATTA AATGAACTAT TGGATAGTTT TGGAAAAGCA GACATATATT TAGTTTCTAT 114 0 

TAGAGTACCT AGAGATTATG AAGGTAGAAT AAATAAATTA ATTTATGAGG CAGCTGAAAA 120 0 

GCGCTcTAAT GTACATCTAG TCGATTGGTA TAAAGCTTCT GCAGGTCATC CGGAATACTT 126 0 

TGCATATGAC GGTATTCACT TAGAATATGC AG G TAG T AAA GCGCTGACTG ATTTGATTGT 1320 

AAAAACGATG GAAACACATG CTACAAATAA GAAATAATTT GATGCACTAA ACTTTTGAAA 13 80 

TATTACATTA CTTCTGATAT TTATTATCAA AAATGATGTA TTTCATTAAA AGTTTAGTGC 14 4 0 

TTTTTTATTT TCAAATCCCA TAGTAACGGT GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 1500 

GTATATTACA TTCAATGAAG CTTTATTAGG AACAGATTAC ATTATGATAA CAAAGCCCGC 1560 

AAGACACCTA ATCTCTGTTA TAGTTTGTTT TGTCGCAAAA CTATAAAAGT TATAATTGTT 1620 

TGCATACTAA AAAAATAAAA AATATAAAAT TTAAAATAAT TGAGTCGCTA ATGACTATAT 16 80 

GTATAGAGTG TTTTGATTAT TGGGAGGATA TTTAATTATG AAAAAAATCG TTACAGCTAC 174 0 
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ACAAAATAAC AATGGATATA ATTCTAATGA CGCTCAATCA TACAGCTATA CGTATACAAT 1860 

TGATGCACAA GGTAATTATC ATTACACTTG GACAGGAAAT TGGAATCCAA GTCAATTAAC 192 0 

5 GCAAAACAAC ACATACTACT ACAACAACTA CAATACTTAT AGTTATAACA ATGCATCTTA 198 0 

CAATAACTAC TATAATCATT CATATCAATA CAATAACTAT ACAAACAATA GTCAAACAGC 204 0 

AACAAATAAC TATTATACTG GTGGTTCAGG TGCAAGTTAT AGCACAACAA GTAATAATGT 2100 

'° TCATGTGACT ACAACTGCAG CGCCATCTTC AAATGGTCGT TCAATTTCTA ATGGTTATGC 2160 

ATCAGGAAGT AACTTATATA CTTCAGGACA ATGTACTTAT TATGTATTTG ATCGTGTTGG 2 220 

TGGGAAAATT GGTTCAACAT GGGGTAACGC AAGTAATTGG GctAACGCAG CTG CAT CATC 2280 

15 

TGGCTATACA GTGAACAATA CACCAAAAGT TGGTGCTATC ATGCAAACAA CACAAGGCTA 234 0 

TTA CGGT CAT GTTGCTTACG TTGAAGGCGT TAACAGCAAC GGTTCTGTTC GTGTTTCAGA 24 00 

OQ AATGAACTAT GGACATGGTG CTGGTGTGGT TACGTCTCGT ACAATTTCAG CAAACCAAGC 246 0 

AGGTTCATAT AATTTCATTC ATTAATCAAA TGTAAATCAA ATGACGTCAA TATTCTCTAA 2 52 0 

CATGAGAGTA TTGGCGTTTT TGTTTTATAT AAATATAAAT GAGAGCGGTT TATTCACTGA 2 580 

25 TCTTTAGGGA ACTAAGTAAT AAAGTGATAA TTTATACTAT GTCAGTATGA TTGAGAGTGA 264 0 

TTCAATTTAG ATGAAAACCA TGAAAAAATA TATTAAAACA GCATTTTTTT GTAGTATGTA 2700 

TTGGTTAATT GTTCAACTAA ATATAGCAAA TTTAGGTACA AGAATTCCTG ATAAGTATTT 2760 

30 TCGTCAGAAG TACATAATAT TTAAATCATT TAACTTTGAG AAGCATGGAA AATTTTGGAA 282 0 

CAAATGGTTT TACGTAAGAA AATGGAAACA TAAGATTTTA GATGGTCATC AGCTTAATCA 2880 

AAATATATAT GATCAGCGTC ATTTAATGAC AATCAATACT GATGAAATTG AAAAAATGAT 2 94 0 

35 

TATAGAGACA AAGAGGGCAG AGTTGATTCA TTGGATATCG ATACTTCCAG TCATCATATT 3 000 

CAATAAAGGC C CT CGTTT AG TAAAGTATAT AAATATTTTC TATGCAATGA TAGCTAATGT 306 0 

TCCAATCATT ATTGTGCAAC GCTATAATCG ACCGAGATTA ACGCAGTTAC TACGCATATT 312 0 

40 

AAAACGAAGA GGTGAACGTC ATGACTAAAC ATATCATCGT TATTGGTGGT GGCTTAGGTG 313 0 

GGATTTCTGC AG CAATT CG A ATGGCACAAA GTGGCTATTC GGTCTCATTA TATGAACAAA 324 0 

45 ATAATCATAT AGGAGGCAAA GTGAATCGTC ATGAATCAGA TGGCTTTGGC TTTGATTTAG 33 0 0 

GTCCATCTAT TTTAACGATG CCTTATATTT TTGAAAAATT ATTCGAATAT AGCAAGAAGC 33 60 

AAATGTCAGA CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC TTTTTTCCAG 34 2 0 

50 ATGGAACGAC TATCGATTTG TATGAAGGTA TTAAAGAAAC AGGTCAGCAT AATGCGATAT 34 8 0 

TGTCGAAACA GGATATAGAG GAACTGCAAA ATTATTTGAA TTATACAAGA CGAATCGATC 3 54 0 
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TTCATGGGCC ATTAAATGCT CTTATTAATT ATGATTATGT ACATACTATG CAACAGGCCA 3 66 0 

T AGACAAG CG TATCTCGAAT CCATACTTGC GACAAATGTT AGGCTATTTT ATCAAATATG 3 72 0 

TAGGTTCTTC ATCATACGAT GCGCCAgCTG TATTATCTAT GTTATTCCAT ATGCAACAAG 3 7 80 

AGCAAGGCCT TTGGTATGTA GAAGGTGGAA TCCATCATTT AGCCAATGCC TTGGAAAAGc 3 84 0 

tAGCGCGTGA AGAAGGTGTC ACAATTCATA CAGGTGCACG TGTGGACAAT ATTAAAACAT 3900 

ATCAAAGACG TGTGACGGGT GTCAGATTAG ATACAGGTGA GTTTGTAAAG GCAGATTATA 3 96 0 

TTATTTCAAA TATGGAAGTC ATACCTACTT ATAAATATTT AATTCACCTT GATACTCAAC 4 02 0 

GATTAAACAA ATTAGAGAGG G AATTTG AG C CGGCAAGCTC AGGATATGTG ATGCATTTAG 4 08 0 

GTGTTGCTTG CCAATACCCG CAATTAGCAC ATCATAATTT CTTTTTTACG GAAAATGCTT 414 0 

ATCTCAATTA T c AACAAG TT TTTCATGAAA AGGTATTGCC AGATGATCCG ACCATTTATC 42 0 0 

TAGTAAATAC GAATAAAACT GATCACACAC AAGCGCCAGT AGGTTATGAA AATATCAAAG 4260 

TCTTACCACA TATTCCATAT ATTCAAGATC AGCCTTTTAC CACTGAAGAT TATGCGAAGT 432 0 

TTAGGGATAA AATTTTGGAT AAAT TAG AAA AAATGGGACT TACTGATTTA AGAAAACACA 43 8 0 

TTATTTATGA AGATGTTTGG ACACCGGAGg ATATTGAAAA AAATTATCGT TCTAATCGTG 4 44 0 

GTGCAATATA TGGTGTTGTA GCAGATAAAA AGAAAAACAA AGGATTTAAA TTTCCTAAAG 4 50 0 

AAAGTCAGTA TTTTGAAAAC TTGTACTTTG TAGGTGGATC AGTAAATCCT GGTGGTGGCA 4 56 0 

TGCCAATGGT TACATTAAGT GGGCAACAAG TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 4 62 0 

GAATAGGAAG TG AT AT CT AT GAAATGGTTA TCACGAATAT TAACAGTAAT AGTGACCATG 46 80 

TCTATGGCGT GTGGTGCaTT GATATTTaAT CgTAGACATC A 4 721 
(2) INFORMATION FOR SEQ ID NO: 23 5: 

Mi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 

50 AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA C AG CTTTATC 180 

TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATC CTTA 24 0 
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TACAGCGATT GTTGTTAACC CTAATGACGA 
TATATTACCA ATCGTAGGAC GCGAACTGCC 

5 CTTCGGTTCT GGTGCTATGA AAGTGACACC 

TCAAAGACAT CAATTAGAAA ATATTATCGT 
AGCGGGT AAA TATGAAGGTA TGGACCGTTT 

W AAAAGAACAA GATTTAGTTA TCAAGATTGA 

ACGATCTGGC GCTGTTGTTG AA C CAT ATTT 
CTTAGCGAAA CGTTCATTAG ATAACCAAAA 

15 

ACGTTTCGAA CATACATTTA AC CAATGGAT 
ACAATTATGG TGGGGTCATC AAATTCCGGC 
TGTTGGAGAA GAAGCGCCAA CTGATATTGA 

20 

TACGTGGTTC TCaAGTGCTT TATGGCCtTT 
TGAAGACTTT AAACGATACT ACCCAACAAA 

25 TTTCTGGGTA GCACGCATGA TATTCCAAGG 

TGATGTATTA TTACACGGTT TAGTTCGTGC 
AGGTAATGGT GTGGATCCAA TGGATGTTAT 

30 CTTCTTAGCA ACAGGTTCAT CTCCAGGACA 

GTCAGTGTGG AACTTTATCA ATAAAATCTG 
CGGTGAAGAC TTTAAAGTTG AAGATATCGA 

35 

ATGGATTCTA ACACGTTTAA ATGAAACGAT 
TGAATTCGGC GAAGTTGGAC GTGCATTATA 
GTACATTGAA ATGAGTAAAA TTCCAATGAA 

40 

ACGTTCAGTA TTGAGTTATA CTTTAGACAA 
ATTTGTAACA GAGAAAATAT GG CAAAGTTT 

45 TTCATGGCCA GAAGTGCGTG AATCATTGAT 
ACTTGTTGAA ATCATTAAAT CTGTAAGACA 
TAAAGAAATA CCTATTTTAA TTCAAGCTAA 

50 AAACAAAGAT TATTTAATCA AATTCTGTAA 
AAAwTCCTGA GAAAGCAATG ACaTCAGTTG 
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ACGATACAAA GATGTAATCG GTAAAACTGT 360 

TATTTTAGCA GATGAGTATG TTGATATAGA 4 20 

AGCACATGAC CCTAATGATT TTGAAATTGG 4 80 

TATGGATGAA AATGGTAAAA TGAACGACAA 54 0 

TGATTGTCGT AAACAGCTAG TTAAAGATTT 600 

AGATCATGTT CATTCTGTAG GTCATTCAGA 660 

ATCAA C AC AA TGGTTTGTGC G CATGGAAG A 72 0 

AACAGATGAT CGTATTGATT TTTATCCGCA 7 80 

GGAAAATATT AGAGATTGGA CGATTTCAAG 84 0 

TTGGTATCAT AAAGAAACAG GCGAAATATA 900 

AAATTGGCAA CAAGATGAAG ATGTATTAGA 960 

CTCyACGTTA GGTTGGCCTG ATTTAGAAAG 102 0 

TGCCTTAGTT ACAGGTTACG ATATTATCTT 1080 

CTTAGAATTT ACAGATCGTC GTCCATTTAA 114 0 

TGAAGACGGG CGTAAGATGA GTAAATCATT 12 00 

TGACGAATAC GGTGCTGATA GCTTGCGTTA 12 6 0 

TGATTTAAGA TACTCAACTG AAAAAGTTGA 13 20 

GAATGGGGCA CGTTTCAGTT TAATGAATAT 13 80 

TTTAAGTGGT AACTTATCAT TAG CAG AT AA 144 0 

TGCAACAGTT ACTGATTTAA GTGACAAATA 1500 

TAATTTCATT TGGGATGATT TCTGTGATTG 156 0 

TAGTAATGAT GAAGAACAAA AACAAGTTAC 162 0 

TATTATGAGA ATGCTACATC CATTCATGCC 1680 

ACCACATGAA GGTGACACAA TTGTTAAAGC 174 0 

TTTTGAAGAA AGTAAACAAA CAATGCAACA 1800 

ATCACGTGTA GAAGTAAATA CGCCATTGTC 186 0 

AGATAAAGAA ATTGAAACAA CTTTATCACA 192 0 

TCCTAGTACC TTAAATATTA gCtGACGTGG 198 0 

TAATTGCAGG TAAAGTGGTA TTAC CATT AG 2 04 0 
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AAAGCGAATT AGATAGAGTA G AT AAAAAG C TCTCTAATGA AAACTTTGTA AGTAAAGCAC 2160 

CTGAAAAGGT TATAAATGAA GAAAAACGTA AAAAACAAGA TTATCAAGAA AAATATGATG 22 20 

5 GTGTGAAGGC AAGAATTGAA CAATTAAAAG CATAGGAGTT AGTAACAATG AATTACCTAG 22 8 0 

AGAGCTTGTA TTGGATACAC GAAAGAACTA AATTTGGCAT CAAACCAGGT GTTAAACGTA 2 34 0 

TGGAATGGAT GCTAGCACAA TTTAATAATC CTCAAAATAA CATTAAGGGT ATTCATGTAG 2400 

10 

GTGGCACAAA TGGTAAAGGC TCTACAGTTG CTTACCTTAG AACAGCTTTA GTTGAAAATG 2460 

GTTATGAAGT AGGTACATTT ACGTCGCCGT TTATTGAAAC ATTTAATGAA CGAATTAGTC 2 52 0 

TAAATGGTGT GCCAATATCA AATGACGCTA TTGTAGAATT AGTATCACGT ATTAAACCAG 2580 

15 

TAAGTGAAAT GATGGAACGT GAAACAGATT TAGGTGTTGC AACTGAATTC GAAATAATCA 264 0 

CAGCGATGAT GTTTTTATAT TTTGGTGAAA TACATCCTGT TGATTTTGTC ATTGTTGAGG 270 0 

20 CTGGATTGGG TATAAAGAAC GATTCGACAA ATGTCTTTAC ACCGGTTTTA TCAATCTTAA 2 76 0 

CTAGTATCGG TCTAGACCAT ACAGATATTT TAGGTGGTAC TTATCTAGAT ATTGCTAGGG 2 820 

ATAAAGGCGC GATTATAAAG CCTAACGTTC CAGTGATATA TGCTGTTAAA AATGAAGATG 2 88 0 

25 GATTAAAATA TGTTCGTGAA CGCGCAATTG AACAACATGC AAAGCCAATT GAATTAGATA 2 94 0 

GAGAAATTGT TGTTGTATCG CAAAATGATG AATTTA CTTA CCGTTATAAA GATTATGAAT 3 00 0 

TAGAAACAAT CATTTTAAGC ATGTTAGGTG AACATCAGAA ACAAAATGCT GCATTAGCCA 3 06 0 

30 

TAACAGCTCT TATTGAATTA AATGAACAAG GATTAATTGA ATTAGATTTC AATAAGATGA 3120 

TAGACGGTAT TGAATCAGTT CGTTGGACTG GACGTATTGA GCAGGTGCAT GACAAACCTT 3180 

TAATCATTTT GGATGGCGCA CATAATTCAG AGAGTATAGA TGCTCTAATT GATACAATTA 3 24 0 

35 

AACAGTACCA TGATAAAGAA AAAGTAGATA TTTTGTTCTC AGCAATAAAC GGAAAACCGA 3300 

TTAA&AGAT GGTCAAACAT TTAAGTTTAA TTGCGCATAC GTTTTATGCA ACTGAATTTG 3360 

40 ATTTTCCGAA AGCGTTACGC AAAGAAGAAA TTGTAGGTAG TATTGAAAAT GATGAAATAC 342 0 

AATTAGTAGA TGACTACGTT GAATTTATAA AAAATTATCA AGGTGATACA TTAGTAATTA 34 8 0 

CCGGTAGTCT GTATTTCATA AGTGAAGTTA AATCAA 3 516 

45 (2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7481 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TGAGTGATAG AATCAAAAAA GCCATCTCAA 
ATGsTCGCAA ATCACCAATG TATCACTCTC 

5 

TAGTTATTGA GGTTTTGTGA TATATAGTAT 
AAAATTCATm TTCgGGGTCG GGTGTAATTC 
GCTAGTATGT ATCATATTAG TGGCTGATCT 

10 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT 
TGGAAGGAGA AAATTGAATA TGCAACAAAA 

J5 CGCGATTGCG TTTGTGTTAA CTTTTATCAA 

AACTTTAGAT TTTAGTGATG TACCGTCACT 
CGGTATCGTA GTTGCACTGG TTAAAAATTT 

20 AGTTGGACCA TTTGCTAACT TTTTAGCAGG 

CTATAAAAAT AAACGTTCAA CAAAATCTTT 
TATGACTATC GTGTTGAGTA TTTTGAACTA 

25 ATTTAACTTA GCTGATATCG CAAATAATCT 

ATTCAATATT ATTAAAGGTA TCGTTATTTC 
TGCGAATTTC TTGAAAAGAA TTTAATCAAA 

30 

AAAAGCAGGT GACTATCAAT AAACGATAGC 
AAAAAATCAA TTATTCaAAT TTTAAAGCGT 
TAGAATCTGt AGGGCATCCa TCAATTGCAT 

35 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA 
CATATATATC TGGCGCTGCT GCACCGcATG 

40 CGATTGTATA TTTTGCCAAT GTCTTCGCCT 

GACTAAAATT TTAGACAGCA TCATTTTTAT 
TTAAATTTGC AACACATTAT AAAAACAGCA 

45 AAAAGTATTT ACAATATCTT AGTTGGTAAG 

AGTCAACAAC AGTTGTCATT ATATCACAGT 
CTATTTtTAG AAAAAATCAA TGAATTTAAT 

50 

TATACATTTG aAAGCATGGG GC AAACATTT 
TCTAATACCA AACAACATGT TTTTCATTTT 

55 



AAATTAATCA AGCAAACAAC ATTCCAAACA 6 0 

CAATTACGTA ACTATGATTT AATTTAAGCA 120 

AAAATTAATG AGAATTAAAT TTAATAATGT 180 

CCAACCGGCA GTAAATAAAG CCTGCGACCT 24 0 

AGTGAGATTC TAGAGCCGAC AGTATAGTCT 3 00 

GCAATAATCC TCCTATTCTT ACGAGATGAA 3 60 

TAAACGTCTT AT CACAAT AA GTATGTTGAG 4 20 

GTTTCCTATA CCATTTTTGC CACCATACTT 4 80 

ACTAGCTACA TTTACGTTTG GACCAGTTGC 54 0 

ATTGAACTAC TTATTTAGTA TGGGCGATCC 6 00 

CGCAAGTTTC TTATTAACTG CTTACGCCAT 660 

GATTACTGGA TTAATCATTG CAACAATCGT 72 0 

TTTCGTTCTA TTACCTTTGT ACGGTATGAT 780 

TAAAGTAATC ATTGTTTCAG G AATTAT AC C 84 0 

TATTGTATTT ATTTTACTAT ATAGAAGGCT 900 

TTAAAGCAAA ATAATATACA CATAATAATA 96 0 

TTGCCTGCTT TTTC T AT AG A ACATTTGTCT 102 0 

CCCCATCAAA TGATTCGTCT GCAATTTTAA 108 0 

CTTCCATATC TTCATATAAT TCCtCAGGTA 114 0 

TTACGAAAGC AATACCTTCG TCGTCGTAAT 1200 

CACCACATGC AATACAAGTA TCCATATCAA 12 60 

CCTTTGATAA AAATG CTAAA ATAGTAATGT 1320 

TTTCAAATTA TCCGTTTTAC AGAGTGAGGG 13 80 

TTACAACAAA CATTTAACTA TAAAACAAAT 1440 

AAATCTCACC AAACCTTTTT TGACGCTTGT 1500 

TTACCACTAT TAAAATATCC GTCTTTTGAG 1560 

GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

CAAGCAATTC AACTATTAGT GCAAaCCATG 1680 

GTACCAATCT CTCAAAATAA TAAGATACAm 174 0 
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AATGAACTAC ATAACTTGTT TAAAGCAATA 
TATTTGCAAG GCTATGATGA ACCAATGTAT 
5 CTATCTCAAC AGCAATTGTT TGAATACGAA 

TTAGAAAGTG GAGAATATAC TATTTTATCA 
CAAACTTATA TTACTTATAC AAAATTGCTT 

w 

CAACAACAAG TTAAAATCAA TACTATCGAA 
TACATGTCTA ATTACGATGA TTATGTTGAA 
TATCAACAGC ATCGTGGCGA ACGATTAAAA 

15 

TATTTTCAAT TAAAAGTATT AATCGTTGGA 
TATTTTACGA AACAAATTTG GATTCGAGAG 

20 AAGTATAATG TCTCAACAAC ACACTCTAGG 
GTGTTATCAA ATACCTACGT ATTTATCAGG 
ATCTTTAATG GATGACCAAG TTATGCAGTT 

25 TATTCACTCT GGTATGGATG AAATTGAGAA 
CCGCTTCATC TTTCTAAGTC CAGAATTTCT 
TATGATAGAC TTTGGCATGA TTGTTCTAGA 

30 

TGATTTCAGA CCACATTATG CTCTAATAGG 
TGTCTTAGCA TTGACAGCAA CTGCACCACC 
AGCGATTCAA TTCAATGTTA TTAAAACTAC 

35 

TCTTAATTTT CATGATGATG AAGATAAAAT 
GGGAGCAACG ATTATTTATG TCTCATCGAA 

40 TTATGATTCA GGTTTTCTTA CAGGTATTTA 
CACAGTTCAA CAACAATTTT TAAATAATGA 
TGGTATGGGA ATTAATAAAA AAGATATTCG 

45 TCCTTCTAAC TACATTCAAG AAATTGGCCG 
AATTAGTTTA TTCCAACCGG ACGATAAATA 
G AT AA CAGAA GAAGATGTAC AAAATTTCGA 

50 

AGCCGTTTTG ACAACGTTGc AATCATTCTA 
GCAATCATTT AAACGAAAGC AATTAGGATT 

55 



ACTTTAAAAG GGCCATGTTA CTTACATTAT I86 0 

ACGAGACAGC AAGTTAGTTT AATAGAAAAG 1920 

ATGAATAATT TAGTGACAAT GATGTTTGAA 1980 

AAAATAATAA TGAAACCTAC ATTATTAAAT 204 0 

GAACAATTCA CGATGGAAGA TATAGCGGCT 2100 

GATCATGTAC TTGAAATCTT AATCAAAGGT 2160 

CTAGAAGATC AACTCCAGTT TTTGAATTTT 2 22 0 

TTTTACAAAG AACAATTTGA CACGTTATCA 2 2 80 

TTTGAAAGAG GTGATCTGAA TGTTGCATGA 234 0 

TTTTAAACCG GGACAACAGG AAATTATAGA 24 00 

TATACTTCCA ACTGGAAGTG GAAAGAGTTT 24 6 0 

TAAGCCGACA TTAATTATCT CACCG TTAAT 2 52 0 

GAAAATAAAT GGAGAAAAAC GTGTAACATG 2580 

AAAG CAT AAT ATTAAATGTT TACGACATAG 2 64 0 

CCTGCAACCG TCAAATTTTA AATTAATATC 270 0 

TGAAGCACAT TGCCTATCTG AATGGGGATA 276 0 

AAAAGTAACA AAGCATTTTA AAGAAGCGGT 2 82 0 

GCATTTACAA GATGATTTGA CGGAAATGTT 2 88 0 

AATGAATCGC CCAAATATAA GCTTTAAGCA 2 94 0 

TGAATGGTTG CTGCCGTTTC TACAACAGTC 3 000 

AAAGATGTGT CTGAATTTAG CGCAACTTAT 3 06 0 

TCATGGTGAT ATG AATTAT C AAGAGCGACA 312 0 

TATTCCGATT ATAGTCGCAA CGAGTGCTTT 318 0 

CACAATCATT CACTTTCATC TTTCAACAAG 3 24 0 

TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 3 3 00 

TATTTTAGAA ACGTTATTAT TTGCAGATAT 33 60 

AATAGGAGAA TTTTTAGCTC CCGATAAACA 34 2 0 

TAGTATCGGC GCCTTGaAAC AGATATTTAA 34 8 0 

CTTT CGCATG ATTGGCTATT GCAAATTGGA 3 54 0 
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ATGTTGTGAC AATGATTCTA ATATAACTGA TATCGCAATT TTAAATAAGA AGAAGGTAAT 3 6 60 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT ATTACTTTAC 3 72 0 

5 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3 780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 3 84 0 

AGAATAGAAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3 900 

10 

CAATCGATAG ACACAAATTC ACATCAAGAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3 96 0 

GAATTAGAAC ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC AAGAAATGCC 4 020 

15 CAAAGAAGAA AAAGACGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT TCACAATGAA 4 080 

TCACAAACAT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA TCGTCAAGTC 414 0 

GAATCATCAC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGACAGTAC ACCTCAACAT 4 2 00 

20 GAAGAGGAAT ATTATAATAA GAATGCTTTT G CAATGGAT A AATCACATCC AGAACCAATC 4 260 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 4320 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 4 3 80 

25 

GCTAACCAAT CAG AAA CATC AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 44 4 0 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGTGC TGGAACAGCA 4 500 

GGTGTTGCAG GTGCAGCTGG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4 560 

GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA 4 620 

GATAAGTCTA AAGATCATCA TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4 6 80 

35 GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTCT 4 740 

AATAATGCAA GCCAAAACCA TGATGAACAT GACAATCATG ACAGAGATAA AGAACGTAAA 4 80 0 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGCAG CTGTACTAAT TATCGGTGCA 4 86 0 

40 TT AG CG AT AT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4 92 0 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC ATCTAAAGAC 4 98 0 

GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA TCAAGACAAA 504 0 

45 GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA CAATCAAGCA 5100 

CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 5160 

GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 5220 

SO 

TACTACGGTT CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA TGGTTTAAGT 5280 

GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA TATAAATTGT 534 0 
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TAAATTG CGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTGT TATTTAACAA 54 60 

GGTCTTGTCT CGAATATTGG CAT AT CAATT TAACTTTTTA AATAGTCATC AAAAAGATAA 5520 

5 AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG 55 80 

TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA TTGTCAAAAA 564 0 

TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 5700 

w 

TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT GGTGGAGGGC 576 0 

CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATTATTG 5820 

AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 58 80 

15 

CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 594 0 

GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 6000 

20 CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 6060 

AAGATGTTTA TGAATGTCGA TTTTTAACAA TCG CGACAGG CTATTATGGT CAGCATAATA 6120 

CATTAGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 6130 

25 CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCGGCTATC GATGCTGCTT 624 0 

TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA G ATT ATT CG C 63 00 

CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAAAAATTG 63 60 

°° ACATGGAA r rr TAATGCTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 64 2 0 

ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT TATCATCCCG 64 80 

ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAG CGC 654 0 

35 

CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 6600 

TTGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 

TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 

40 

CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC AAATGAGGTT 6780 

GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 684 0 

45 GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT TAATCTTGCT 6 90 0 

TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 6 96 0 

CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC TAGCACAATA 702 0 

50 GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 70 BO 

TGTCCTAACG CTTCAATAAC CATACCATCG ATACCTTCTC GACTATAAAA ACTAAAAATG 714 0 
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TCCAATGCTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 72 60 

CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG TGTTTTGGTA 7 3 20 

ACATTACGCG CCGTATGAAT TT CATCATT A AaTACAACCA TCACGCCTTT ATGACGGGCC 73 8 0 

TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AGAACCAATT 744 0 

TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 74 81 
(2) INFORMATION FOR SEQ ID NO : 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: €346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

ATGAATTATA CCTTCCACTA GAAGTGTCGG TATTTATGTG CCTGGTGGTA AAGCAAGTTA 6 0 

TCCTTCAACG GTATTAATGA CAGCGACGTT AGCACAAGTA GCAGGTGTGG AAAATATTGT 120 

25 CGTTGTGACA CCACCTCAAC CTAATGGTGT GTCTCAAGAA GTATTAGCAG CATG C TAT AT 180 

TACGCAAGTT GATCAAGTAT TTCAAGTTGG TGGTGCTCAA AGTATTGCTG CGTTAACTTA 24 0 

TGGCACAGAA ACGATACCTA AAGTTGATAA GATTGTAGGT CCAGGCAATC AATTTGTTGC 3 00 

30 ATATGCCAAG AAATATTTAT TTGGACAAGT AGGTATTGAC CAAATAGCTG G AC CAACAG A 36 0 

AATAGCACTT ATTATTGATG AAACAGCAGA TTTAGATGCC ATCGTATATG ATGTTTTTGC 42 0 

ACAAGCAGAA CATGATGAAT TAGCACGTAC ATATGCCATT AGTGAAGATG CGCAAGTCCT 480 

35 

TAAAGATTTA GAATCACGTA TTGCTAAAGC ATTGCCTAAT GTGGACAGAT ACGACATTGT 54 0 

TTCTAAAAGT ATCGCTAATC AACACTACCT TATCCATGCT AGTAATTTTG ATGAAGCATG 600 

CCATGTCATG AATACAATCG CGCCTGAACA TGCGTCGATT CAAACAGTAA ATCCTCAACC 660 

40 

ATATATTGAA AAAGTGAAAT ATGTGGGTGC ATTGTTTATT GGACATTATT CGCCAGAGGT 72 0 

CATAGGAGAT TACGTTGCAG GTCCAAGTCA TGTATTACCT ACAAATAGAA CAGCTAGATT 780 

TACCAATGGG TTATCGGTCA ATGATTTCTT AACACGGAAC ACGGTCATCC ATTTATCAAA 84 0 

45 

AGATACGTTT GAACAAATTG CTGATTCAGC ACAACATATT GCTCATGTTG AAGCATTATA 900 

CAATCACCAG CAGTCTATTT TAATACGTCA GTCTTAGGGG AGTGTAATTG AAATGATTTA 96 0 

50 TATTGATAAA AATGAAAGTC CAGTTACGCC GTTGGATGAA AAAACAATGA CGTCTATTAT 102 0 

TAGTGCAACg CnATATAATT TATATCCTGA TGCAGCATAT GAACAATTCA AGGAAGCTTA 1080 

55 



1017 



EP 0 786 519 A2 



GATTCAAAAG TTAATGCTGA TCATGCCAGA AGGTCCGGCA TTAACGCTAA ATCCTGATTT 12 00 

TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCGTGAA ATTGCATTTG TAGATGCAGG 1260 

5 ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACGAAA ATCGATGAAG TACAACCATC 132 0 

ATTTTTTATT ATGAGTAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGG CATTTTT 13 80 

AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 144 0 

w 

TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA CACATCTTAA GAATGCGTAC 15 00 

ATTATCAAAG GCGTTTGGAA TTGCCGGCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 156 0 

AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 16 20 

15 

TATTGCGACT TATATTTTTA G A CATAG AG A AGAGACAAGA CAATTTTTAA CGATGCAACG 16 80 

ACAGTTAGCT GAGCAGTTAA AACAAATATT TGATACACAT GTTGCAGATA AAATGTCAGT 174 0 

20 GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AATTAGGACA 18 00 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCGGTGA TGAAGGGCTA 186 0 

TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA TTAGAAGAAA TTGTTAAAGA 192 0 

25 ATGGAGTGCA AAATATGATT TATCAAAAAC AACG AAA CAC AG CTG AAACG CAACTAAATA 1980 

TTTCAATATC TGATGATCAG TCACCATCGC ATATTAATAC AGGTGTGGGC TTTTTAAATC 2 04 0 

ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAACA1TGAG GCACAAGGTG 2100 

in 

ATATTGACGT AGATGATCAC CACGTAACTG AAG AT AT CGG CATTGTCATT GGCCAATTGT 216 0 

TACTTGAAAT GATTAAAGAT AAAAAGCATT TCGTTCGTTA TGGAACGATG TACATTCCAA 2220 

TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 2280 

35 

ATGCATCATT AAGTAAAGAA AAAGTTGGTA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 234 0 

TTAGAGCGGT CGTAATCAAT G C AAG A T T AA C AACG CATAT TGATTTAATT CGTGGAGGTA 24 00 

ATACACACCA TGAAATTGAA GCTATATTCA AAGCGTTTTC CCGTGCATTA GGCATAGCGC 24 6 0 

40 

TAACTGCAAC TGATGATCAG CGTGTGCCGT CAT CG AAAGG TGTGATTGAA TGATTGTCAT 2 52 0 

CGTTGATTAT GGATTAGGGA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 258 0 

4S TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 264 0 

CGGTGTCGGC CATTTTAAAG ATGCGATGTC AGAGATAAAA CGATTAAATC TCAATGCAAT 2 70 0 

ATT GG CT AAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 2 76 0 

50 TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 2820 

TATCCAAACA GAATACCCAG TGCCACACTT AGGCTGGAAT AATTTAGTGA GTAAGCACCC 2 8 80 
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AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 3 00 0 

TGGTATTCAA TTCCATCCTG AAAAAAGCGG TACATATGGG TTACAAATTT TGCGTCAGGC 3 06 0 

5 AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGGGTCAA 312 0 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 3180 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 3 24 0 

10 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAGAT 3 3 00 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGT CACAA ATCATGGACT 336 0 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 342 0 

15 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 34 8 0 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 354 0 

20 TGAGACGGTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 36 0 0 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGT CAATT AGTAAAGGCA ACGACGATTC 36 6 0 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 372 0 

25 ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 3 78 0 

AATGATTAAA AAACGT AT C A TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 3 84 0 

GATT CAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 3 90 0 

10 

TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3 96 0 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4 02 0 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 4 0 80 

35 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 414 0 

CGGTAGACAA TGCATCTGCA TAGCAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCACA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4 2 60 

40 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGGAC ATGATGGTAT 4 3 20 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4 3 80 

45 CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 444 0 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4 500 

TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4 5 60 

50 AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4620 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTG TGGT ATGTTTCTAT 46 80 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATGTCAT ACCAAATGGA 4 800 

CCAACATGTC ATACAGG CAG TCAAAGTTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 4 860 

ACATTAGCGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 4 920 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 4 980 

GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 504 0 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 5100 

GAATTAGCGC GTAGACATCA TAAGCGCAAT AACTTTAAAG GTGAACGACA AAATATCGAA 5160 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 5220 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTTTAACT 52 80 

ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCGTTATA GATAAGTAGT GGATAATCAC 53 4 0 

AATACGAAAT CAAAAATAAT TATAAAAAGT AAATTGAGCA ACTCAGGAAT AGATGTCACT 54 00 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 54 60 

GAAAAAGCTA AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 5520 

TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 55 8 0 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GAT ATG CATC C CGAAAGTT A GACTAAAAAA 564 0 

CTAACTTTAT GGGATGTATT TTTATGCTAA T CAT CAT AAA TTCGAGATTA AGTTAAAGGT 5700 

AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 576 0 

AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 5 820 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 5 880 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 5 94 0 

CCAAGCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTtGCATA 6000 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 606 0 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 612 0 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 6180 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 624 0 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 6 3 00 

GTAGGTGCAT CTTCCATTTT AATAGCTACA TTACTATTTT TAAGTG 634 6 

(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3775 base pairs 
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CO STRANDEDNESS : double 
CD) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

TGATAATGAT TCTCATTGTC ATACATCACG AAGGAGGCTA ATTAGTCAAT GAATAAAGTA 60 

10 ATTAAAATGC TTGTTGTTAC GCTTGCTTTC CTACTTGTTT TAGCAGGATG TAGTGGGAAT 12 0 

TCAAATAAAC AATCATCTGA TAACAAAGAT AAGGAAACAA CTTCAATTAA ACATGCAATG 180 

GGtACAACTG AAATTAAAGG GAAACCAAAG CGTGTTGTTA CGCTATATCA AGGTGCCACT 24 0 

15 GACGTCGCTG TATCTTTAGG TGTT AAA C CT GTAGGTGCTG TAGAATCATG GACACAAAAA 3 00 

CCGAAATTCG AATACATAAA AAATGATTTA AAAGATACTA AGATTGTAGG TCAAGAACCT 3 60 

GCACCTAACT TAGAGGAAAT CTCTAAATTA AAACCGGACT TAATTGTCGC GTCAAAAGTT 4 20 

20 AGAAATGAAA AAGTTTACGA TCAATTATCT AAAATCGCAC CAACAGTTTC TACTGATACA 4 B0 

GTTTTCAAAT TCAAAGATAC AACTAAGTTA ATGGGGAAAG CTTTAGGGAA AGAAAAAGAA 54 0 

GCTGAAGATT TACTTAAAAA GTACGATGAT AAAGTAGCTG CATTCCAAAA AG ATG CAAAA 6 00 

25 

GCAAAGTATA AAGATGCATG GCCATTGAAA GCTTCAGTTG TTAACTTCCG TGCTGATCAT 6 60 

ACAAGAATTT ATGCTGGTGG ATATGCTGGT GAAATCTTAA ATGATTTAGG ATTCAAACGT 720 

AATAAAGACT TACAAAAACA AGTTGATAAT GGTAAAGATA TTATCCAACT TACATCTAAA 7 80 

30 

GAAAGCATTC CATTAATGAA CGCTGATCAT ATTTTTG TAG T AAAAT CAG A TCCAAATGCG 84 0 

AAAGATGCTG CATTAGTTAA AAAGACTGAA AGCGAATGGA CTTCAAGTAA AGAGTGGAAA 900 

J5 AATTTAGACG CaGTTAAAAA CAACCAAGTA TCTGATGATT TAGATGAAAT CACTTGGAAC 96 0 

TTAGCTGGCG GATATAAATC TTCATTAAAA CTTATTGACG ATTTATATGA AAAGTTAAAT 1020 

ATTOAAAAAC AATCAAAATA ATTAAGGAGT TTTACGATGC TACTTAAACC AAAATACCAA 1080 

40 ATCGTTATTG CTGGTTTATG TCTTGCAATA GTAGCTATCT TAAGTTTAAT GATTGGAAAT 114 0 

ACGCTTGTGT CACCAGGTAC GGTGATACAG GCGTTATTCA ACTTTGATAG TGAAAACGAT 120 0 

TTACATGATG TTGTCACTGG TGCACGGGCG TCGAGAACAA TCATTGCGTT ATTGACTGGT 126 0 

45 GCTGCCCTTG CTGTCTCAGG TTTGTTGATG CAAGCACTTA CACGAAACCC AATAGCCTCA 13 2 0 

CCAGGGCTTT TCGGTGTCAA TGCAGGCGCA GTATTTTTTG TCATTTTTAG TATTACATTT 13 8 0 

ATCCAAATTC AATCTTTTAA AATGATTGTA GTTATTGCAT TTTTGGGGGC TATTGTTGTT 144 0 

50 

ACTGTATTAG TTGTTGCACT AGGTATGTTT AGACAAACAC TATTCTCACC TCACCGTGTC 150 0 

ATTTTGGCAG GTGCTGCGAT TGCGATGCTA TTTACAG C CT TTACTCAAGG CATACTTATT 1560 
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AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTTT AATTGCATTT 16 80 

AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 174 0 

5 CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTATT 18 00 

TCGGTAGCCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 1860 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 192 0 

10 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 1930 

GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATGAA GAAAGGACGT 204 0 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 2100 

15 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT ATAC CATTAC 2160 

AACG CGTACT GATGTACTTT ATAAATCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 2220 

TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 2 2 80 

20 

GTTTAATGTT G CAAAATG T A TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 2340 

CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 24 00 

2s ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 24 6 0 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCGATGCAAA 2 520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAG CAATTA TCTGCTGCCA 2 5 80 

30 AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT A CAATCATTT 2 64 0 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 2 700 

TAT CT AT ACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 276 0 

35 TAATCCAATT AATCACTTCT ACTATACTCG TATCTATGGC AATCAGTTTA GTAGGTAACA 2 82 0 

TTGG6TTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2 8 80 

CTAAAAAGTT ACTAATGTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACTTAA 2 94 0 

40 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 3 000 

GTGCCCCATT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 3 060 

ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGTT 3120 

45 

TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TT A CAT ATT A 3180 

ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 3 24 0 

50 TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 3 300 

AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3 360 

55 



1022 



EP0 786 519 A2 



ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtnTCGTTT 34 8 0 

TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 354 0 

5 TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 36 0 0 

TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG AAACATTCAT ATATTAGTAA 36 6 0 

AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 37 2 0 

w 

nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 3775 

(2) INFORMATION FOR SEQ ID NO: 239: 

<i> SEQUENCE CHARACTERISTICS: 
is (A) LENGTH: 1361 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

AGAAAAATTA GCCTACCTAT GCAAGTTGTT ATTGCATTAG TGTTAGGTGT TGTCGTAGGA 6 0 

PS CTTTTATTAT ATGGACAAGA AAACGTAGCA AATTACATTA AAC CATTTGG TGACGTATTT 12 0 

TTAAATTTAA TTAAAATGAT CGTTATACCA GTTGTATTTT GCTCACTAGC G CTTTCTATT 180 

TCGAACGTTG GGGAATCGAA AACTGTAGGG CGTTATGGCT GGaAAACAAT TTTATACTTT 24 0 

30 GAAATTATTA CAACAATCGC AATAGGTTTA GGGATTATCT TCGGTAACCT ATTTAAACCA 3 00 

GGTGCTGGAT TAGACCCAAC AAAATTACCT AAAGGTGATA TTTCTAAATA TCAATCAACT 360 

GCACATGCAG CAGAAGAATC TACATATGGa AATCATTTTA TTG ATA C CAT TGTACATATT 420 

35 ATTCCGACAA ACTTTTTTGA AGCTTTAAAT AAGGGTGAAT TATTACCTAT TATCTTCTTC 4 80 

GCAGTATTCT TTGGATTAGG ATTAGCTGCT GTAGGTAAAA AAGCAGAACC AGTTAAAGAA 54 0 

TTTTTAAGCG GATCGCTTGA AGCTGTGTTC TGGATGATTA ATAAAATTTT AAAATTAGCA 600 

40 

CCACTTGGAG TGTTTGCATT CATTTGTACT ACAATTATTA CATTTGGTGC ATCCGCATTA 66 0 

TTACCACTAT TAAAATTAGT ATTAGTTGTT GTCTTTGCAA TGGTGTTCTT TGTATTCGCT 720 

ATACTAGGAC TAGTTGCATG GATGTGTGGT ATTAATATCA TGAATATTAT TAGAATCTTG 780 

45 

AAAAGTGAAT TGCTTTTAGC ATTTTCTACA TCAAGTTCGG AAGCTGTACT TCCTGTAATG 84 0 

ATGAAGAAAA TGGAAAACTT CGGTTCTCCA AAAGAAATTA CTTCTTTTGT TATACCAATT 900 

50 GGTTATACGT TTAACTTAGA TGGATCAGCA CTTTATCAAT CTATTGCAGC ATTATTCGTT 960 

GCACAGATGT ATGGAATGCA CTTAACATTA TCAGAGCAAA TTGTGTTGAT GTTAACATTA 1020 

55 



1023 



EP0 786 519 A2 



ACATTAGGTG CCATGGGCTT ACCGGCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 114 0 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 12 0 0 

ATAGCTAAAT GGGAAAACGT ATATGACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 126 0 

TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 132 0 

TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 1361 



W (2) INFORMATION FOR SEQ ID NO: 24 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1489 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

20 

TCAATATGTA AATCGGCGAT TGTCCCTATC tTCATACCCT TACCTTCTTT ACTCAAATTA 6 0 

AAATGTTATT T ATT AT G CCA TaGCTTATCT AATATATATA GTTAACTGCT TCATTTAGAT 12 0 

25 GATTATTTTA TATTTTTGCA TAAAAACTTA TATCTTTTCA AAAAAATCGA TAAGTTTTAG 18 0 

TTATCATACC CTTACCTATC AAATGTTTTT TCTTATATTT AAAAAAATAA TTGCTTTATT 24 0 

AAATGGATTT CTTTAGTATT TATAATTAAG AAAACGCTTA CACACAACTT TTTTATTTGC 3 00 

30 TTTATCCTGA GGAGGAAAAT TATGGCAAGA AAATTG CAT A G AG AG TTG AA TAACAGACAC 36 0 

ATCCAATTAA TAGCAATTGG GGGCGCAATT GGAACTGGGT TATTCCTAGG ATCAGGTCAA 42 0 

ACAAT AT CTT TAACTGGTCC AT CACTGTTA TTCACATACA TGATTATTGG GGTTGTACTA 48 0 

35 TTCGCTTTTA TGCGCGCATT AGGCGAATTG TTGTTGAGCA ATACAAGATT TAATTCATTT 54 0 

GTTGAXATTG CAAATGAATA TTTAGGCCCT TTTGGTGGCT TTGTCATTGG CTGGACTTAC 60 0 

TGGTTATGTT GGATTGTATC AAGTATGTCA GACCTAACTG CGATGGGACA AT ACTTTG CA 66 0 

40 

TTTTGGTATC CACAAGTCCC AAATTGGATT ACCGTGCTAT TTATTGTTTT AATCTTGATT 72 0 

AGCTTCAACT TATTAGGTGC CAGATTATTT GGTGAACTGG AGTTTTGGTT CTCGATTATT 780 

AAAGTTGTCA CAATTATTGC GATGGTTATC GTTGGTCTTG TATTAATCTT TTTCTCATTT 84 0 

45 

AAAACACATT ATGGACATGC ATCATTCACA AACTTAATCA GTCACGGTGG CATGTTCCCT 900 

GGTGGAaCAT TTGGTTTCTT AATGTCATTC CAAATTGCTG TATATTCATT CATTGGTATT 96 0 

5Q GAACTTATAG GTGTAaCTGC TGGTGAAaCG AAAGATCCTG AAAAAACCTT ACCGAAAGCA 102 0 

ATTAATAATG TACCTATCCG TATTTTATTA TTCT AT AT CG GTGGTCTATT AGTAATTATG 1080 
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TTAATCGGCG TACCATTTGC AGCAGGTGTC GTTAACTTTG TCGTGCTAAC TGCCGCGGCC 12 00 

TCTGCTACAA ATAGTGGTAT CTATTCGAAT AGTCGTATCT TATTCGGACT GTCACAACAA 12 60 

GGGTTAGGTC CTAAAGTTTT AAATAAAACG AATAGTCATG GCGTGCCTTA TTTATCAATG 1320 

TTAGTTTCAT CAATTGCATT ACTTATAGCA GCCTTGTTAA ACTACATTTT CCCTAATGCA 13 80 

ATTCAACTAT TCATATACGT TACAACGTTA tCAACTGTGT TGTTTTTAGT TGTtTGGGCA 14 40 

ATGATnATTG TCGCTTATCn AATGTATTTG GAAAAAGCAT CCTGAGGCA 14 89 

{2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

TTTTCCATCA TrATcACCAT TTAAGACAAT AGCTGTATGA CCATATCCTC CACCAAATCT 60 

25 TCCACTAAAA ACCACTAAGT CCCCAGGTTC AGCTTTGAAA GTTGGTGTGT TGTGATAAAT 12 0 

TTTAGCTTCA CTATTAAAAT TATTTGCATA TGGTATATCT TTAGCTCCAT ATCCTTTtAA 180 

TCCAkGACCA TAAAGAkGAT TCCAATATAC ATTAACTAAA TCGAAACATT GCCATCCATA 24 0 

30 ACTACCGTCG AAATCCCATC CTCTATTCTC TAATGTTTTC AAATAATTTA ATGTACTGCT 300 

ATTACTATTA CTTTTATTAT TTGAAGACAC TGTTTTTGGT TTTGGTTCTA CTAATGGTGT 360 

CATTGGCACT TTTAATTTTT G A C CAATAAA TATTAAATTA GGATTTGCTA TATTATTTGT 42 0 

35 ATTTTGAATA TTTGAAACTG TAGTTTTGTA TTTTAATGCT ATAGCACTAA GTGTGTCTCC 4 80 

TTT'nSTACA GTATAGATTT GTGTTTTTGG AGCTTCTCTA AAACTGTAGT AACCAAAGTA 54 0 

ATTATTAGTA ACTTTATTCG TTTTCTTCTG ATTAGAATTT TGAGCTTCCA AGTTTG CAAT 6 00 

40 

TTTAATTTCT TTAGTAAGTT CATTGTTATT AATAACTAGA TTGTTACCTT GGCTTGAGTT 660 

TTTCGGAGTA TTTGAAATCT TTATATCTTG ATTAATTTCA TTTCCGTTTG AAATTGCTGA 72 0 

TTTGTTGTCT AACTTTAAAC TTGTGTCCGA TGTTTTAACA GCACCTTCAT TTTTTATTTT 7 80 

45 

GTCTTTTGTC GTATTTTTAT TAGCATTTAA CTCTGATTTC GCGAATACAT TTTGCTCATA 84 0 

CCCTCTTGTA AAATCTTTAG ATTTATCAAT TTCATCTGCA TATGCTTTGT TCGACATACC 900 

so CAATGCCAAA AACATACCTA TTGAAATTGA CAAAATTCCA ATACTAACTT TT CTAATTG A 960 

ATAGCGTACT TTACTTTGTT GTTTATTCAT GATGAAACAC TCCTTACAAT AATATACAAA 102 0 
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GTAGGAGGTA AAATAATTAA CTTGTCTTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 114 0 

GCACGACAAA TAGCCCATTT CCGATACTTT TATAAAGTAT GGAaTGGGCT ATAGCCATTT 12 00 

5 ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCTTCTT 126 0 

TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 132 0 

AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT ACCACCTGAA ATGATTGCAC 1380 

10 

CAGCATGTCC CATACGTTTT CCAGGAGGTG CTGTTTGTCC ACCGATAAAG CCTACAACTG 144 0 

GTTTTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 1500 

CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAACATCAA 156 0 

15 

TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AG CTGTAGTT T G AC CAAT AC 162 0 

CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 1680 

CAACATGACC TTTTTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 174 0 

20 

TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT 18 00 

TAACTTTAAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT 186 0 

25 CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 192 0 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT GTTGAAAACA GGAACGCCTT 1980 

CAACAACTTG ACCACCTTTA CCAGGCGTCA CACCTGCTAC TATTTTCGTA CCATAATCAA 2 04 0 

30 GCATTTGTTT TGTATGGAAA AGGGCAGTAG AC C CTGTAAT ACCTTGTACC ATTACTTTAG 2100 

TATTCTTATC TATAAATACA CTCATCTTAG TGCTCCCATC CTTTCCTTAT GCTTCTTTGA 2160 

CTAGTTTAAC AATTTTTTGT GCACCTTCAG CCATTGTTGC TGCTGGTTCA ATTGCTAATC 2220 

75 CTGAGTCTTT TAAGATTTTT TTACCTAACT CAACATTTGT ACCTTCTAGG CGTACAACTA 22 80 

GTGGTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCG ATAACATCAC 2 34 0 

ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTTCA TCACCTAAAA 24 00 

40 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCCGCCTGCA TCTAAGAAAT 2460 

TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2 520 

CATTAACCAT ACATCCGATG TCACCATCTA ATGCAATGTA TGATAAATCA TGTTTAGACG 2 580 

45 

CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCACGTAA TTCTACAACA TCTTTATGTC 2 64 0 

TGAATAATGC ATT AT CATC A AAATTAATTT TAGCATCTAA TGCCAATACA TCACCATCAG 2 700 

so CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATG AATACATTAT 2 76 0 

AAAGTG CTAA TAAGAATTTA GCAGCTTTGT TAACAGATTC TTTAGGAATA TTAATATTAA 2 82 0 
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AGATCTTTTC AGG AG TCTTC GCAGCAACTT CTTCAATCTC AGTGCCCCCT TCTTCAGACG 2 94 0 

CCATCAATGT TACTTGGTCA GTCGCACGAT CAATAACGAA TCCAACGTAA TATTCTTTTT 3000 

5 GAATAGCACA ACCTTCTTCG ATATATAAAC GCTTAATTTC TTTACCTTCT GGACCAGTTT 3 06 0 

GATGTGTCAC CAAAGTTTTC CCTAATAATT CTTTTGCATA TGTTTCTACC TCAGATAAAG 3120 

ATTTAGCAAT TTTTACTCCG CCTGCTTTAC CTCTACCTCC AGCATGAATT TGTGCTTTTA 3180 

10 

CAACATAAAC ATCAGAATTT AATTCTTTTG CTTTCTCCAC CG CTTCTTCA GCAGTAAATG 324 0 

CTACTCGTCC TTCTGGAACT GCAACGCCCA TTGAACGAAA TATTTCTTTA CCTTGATACT 33 00 

CGTGGATATT CATCTTCCAT CCTCCTGTTA CTTAGGTTAA GTTCCCTTAC AATTATAAAA 33 60 

15 

AATGTAAGCG CTATTGTAAA CTTAAATGCT ACTTTTTTAT CATTTAATTG AATTTTA CG A 3420 

TTTACAGTAA CGATTTTATA GGTTCAAAGC TTTTTCTATG CTCTTTCATA ATGCCAATAT 34 80 

CATCGATTGC TAGTAAATGT TGTTTGGTAC CGTAACCCGC GTTTTTTTCA AAA C CAT ATT 3 54 0 

20 

CAGGATAATC TTTAGATAAC TGTGTCATAT AATCATCACG AAAAACCTTT GCCATGATAC 3 600 

TTGCAGCTGC AATGGACACA CTTCTTGCAT CACCCTTGAT TAAAGATACT TG AGG CAG TG 3 66 0 

25 CATTATCAAG CGTCATCGCG TCTATCAATA AATGCGTTGG TTGTACTGAT AATCCATCAA 3 720 

TAGCTCGCTG CATGGCGATT TGAGTAGCTT TATAAATATT AAATT CAT CT ATTTCTTnCA 3 780 

GtGTCGCGAT CCCATATGCA AAAGCAGTAA CTTCATTTTT TAGTGCTTCA TTTAATTCTA 3 84 0 

30 GACGTTTCGT AACAGGTACT TTTTTCGAGT CATCAAGGCC CAAATAATTG TGATTTGAAT 3 90 0 

TTAAAATTGT TGCGCATGCA ACGACTGGAC CTGCTAAAGG TCCTCTTCCA ACTTCATCAA 3 960 

TCCCACAAAT AATAGCATTA GGATGCTCTT TTAATATTTC ATTTTCAAAG TAAG TCATTT 4 02 0 

35 CAACATACTT TTCTTTTAAA GCTTGTTCTT TTTCTAACGC TTTTCTGCGC CTAGCTATGG 4 080 

CATTTTGAAC ACCTTTTCGC TCATCTAAAA AGCATTCATG ATTTTCTAAT TCTTCTATTG 414 0 

TATTAACCGC ATTAATCAAC TG C G TAACTT CTTTAATTGT TAGCGTCATT TGCTAATTCC 42 00 

40 

TCAGTCATAT CTTTAAAAAT ATCAAAACAA TAATTTCCTA TTTTAGCATT TCGAATATCA 4260 

TAAATAATCA GTTCAATGAC TGCTTCGTAA TCAATTTCAT TACCACGTCG AATTAAGCCc 4320 

ACGTTTTTTC CCTATCGCAT CAAACCACGC TATGATTTCT GCAtCTTCAG GAACTTCAAT 4 3 80 

45 

ATTATAATGT GACTTTAATC GCGCTAAATC ATTTTGAATT AAAAAGTTTA ATCCATAGAT 4 44 0 

GGCAACTTCA TCTAAGTGCA CAATACTATC TTTTATCGCA CCAGTTAAAC TCAACTTCTT 4 500 

5Q ACCGACTTCT TCATCTTCAA ATTTAGGCCA AAGTATCCCT GGTGTGTCTA ATAGTTGTAA 4 56 0 

TGCATTACCA ACTTTAATCC ATTGTTGTTG TTTGGTCACA CCTGGTTTAT TACCAGTCTG 4 62 0 
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35 



45 



50 



AACGATCATT GCTCTTATCG CTCTAGGTTT AAGTCCTTTC GCTTTTTCGC GTTCAAATTT 4 740 

TTCAGCAGTC GCCTTAATTG CTGCAGCTTC CACTTTCTTT AAATTTTTAC CGTGCTTAGC 4 8 00 

ATCCACTGAT ACAGGATAgT AtCCTTTATC AATAAAAAAT tGTTCCCATT TTGACATCTC 4 86 0 

ATTTAAATTA GACATATCTT TTTTATTTAA TATAACAACA CGTGGTTTTT GGTTAATAAC 4 92 0 

TTCATCTATC ATAGGGTTTC TTGAACTATA TGGAATTCTT GCATCTACTA GTTCAAACAC 4 98 0 

TACATCTACT TTTTTTAATT 5000 

(2} INFORMATION FOR SEQ ID NO : 24 2: 

(i) SEQUENCE CHARACTERISTICS: 
' 5 (A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 2: 

AACCCGnAGA CGAAACTnCA TAGTTGCCTG TTATGTTGAT TAGTGCGTTT G TAG CT ATTT 6 0 

25 nAAATCAAAC ATTATTAAAT ACAGCGTTaC CTAGTATAAT GAGAGAATTA AATATCAATG 120 

AAAGTACATC GCAATGGCTA GTTACTGGGT TTATGCTTGT TAATGGCGTC ATGATACCTC 180 

TGACGGCATA TCTAATGGAT AGAATTAAAA CTAGACCTTT ATACTTAGCG GCGATGGGGA 24 0 

30 CA T IT ATT AGGTTCTATT GTTGCAGCCT TAGCTCCGAA TTTTGGAGTT TTAATGTTAG 3 00 

CTCGTGTAAT TCAAGCGATG GGTGCAGGCG TACTTATGCC CTTAATGCAA TTTACGTTAT 360 

TTACATTGTT CAGTAAAGAA CAT CG AGGTy TtGCAATGGG ACTAGCAGGT TTAGTAATTC 4 20 

AATTTGCACC AGCAATAGGA CCTACAGTTA CAGGATTAAT TATTGATCAA GCGAGTTGGC 4 80 

GAGTTCCATT TATTATAATT GTAGGAATTG CTATACTTGC CTTTGTTTTC GGTTTGGTTT 54 0 

CAATCTCGAG TTACAATGAA GTGAAATATA CGAAATTAGA TAAGCGTTCA GTAATGTATT 6 00 

CAACTATTGG GTTCGGGTTA ATGCTATACG CATTTAGTAG TGCAGGAGAT TTAGGATTTA 660 

CAAGTCCAAT AGTAATAGGT GCGTTGATAT TAAGTATGGT TATTATCTAT TTATTTATAC 720 

GTAGACAATT TAATATTACT AATGCACTTT TAAATTTAAG GGTTTTTAAA AATAGAACAT 780 

TTGCATTATG TACGATTAGT TCAATGATTA TAATGATGTC AATGGTTGGA CCTGCGCTGC 84 0 

TTATACCGCT ATATGTTCAA AACAGTTTAT CTTTATCTGC CTTGTTATCA GGACTTGTTA 900 

TCATGCCTGG TGCAATAATA AATGGTATTA TGTCAGTTTT TACAGGTAAA TTTTATGATA 96 0 

AGTATGGTCC TAGACCATTG ATTTATACTG GTTTTACAAT TTTAACAATT ACTACAATTA 102 0 
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GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACAGGAATT AATTCTTTGA 114 0 

GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGGTCGTGTA ATGGCTGGTT 1200 

5 

CACTAGGCAC AG CTTTAATG GTTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 1260 

CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA ATCCATTGCT ATAGGGGTGG 132 0 

ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 138 0 

10 

TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 144 0 

GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA ATAAAAAGTT TTATACTTTT 1500 

AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA GATGTCTTAA AATGATTAAG 1560 

15 

GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT GGTAGGAGAA ATTAAAGATC 1620 

CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 1680 

20 AAnAAGAACA TGTGAGTGTT 1700 

(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10146 base pairs 
<?5 (b) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

TGCATCAACT TTCAAACAAT AAATCATCAC AATAAC CACA CCTAATTCAA CACTTTTCAA 6 0 

ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC AGTTACAATT 12 0 

35 

ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTG CCG TGCATGTATT AGCTTTTTTA 180 

ACT AAG CATC aTTCAGAAAA ATTCAATAGT AGTT CATTAG CAGAaTTAaC TTGTTTAAAT 24 0 

CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT TGACACAATA 3 00 

40 

CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC TCTAGCAACA 3 60 

TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC TGG CGACGAA 420 

GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA TCAGCAAGAC 4 80 

45 

GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT CATTGAAGAC 54 0 

ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG GATTTGGGAA 600 

50 AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG CAGTTATCGA 66 0 

ACAATCTC CG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC CTTCGAAGAC 72 0 
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TGTTGTCAAT GCGCTAAACA ATAAAAATTA C CACTT ATT A GCAGACGATA ACAACATTGA 84 0 

TGTACTGGAT TTTAAAGCGC AGTTTAAATC TAATACTGAA GTGAATTTAT TAGATCAACA 900 

5 TGACGATATC GTCGATAGTA TTACTGCAC C TCATATCATT ATTAATACCG GTGCTACCTC 96 0 

TGTCATTCCT AACATTAAAG GCCTTGATCA AGCTAAACAC GTCTTCGATT CGACAGG TTT 102 0 

ATTAAACATT AGCTATCAAC CTAAGCACCT CGTTATTGTA GGTGGCGGTT ATATCGCCTT 10 8 0 

10 

AGAATTTGCT TCAATGTTTG CGAATTTAGG TAGTAAGGTA ACAGTATTAG AACGTGGCGA 114 0 

ATCATTTATG CCACGCGAAG ATCAAGATGT CGTTGCATAT GGTATTACTG ACTTAGAAAA 1200 

TAAAGGCATT GCATTGCATA CAAATGTTGA AACGACTGAA TTGTCATCTG ACAATCATCA 1260 

15 

TACAACAGTC CATACCAACG TTGGTAACTT TGAGGCTGAT GCAGTACTTT TGGCTATCGG 1320 

GCGCAAACCG AATACGGATT TAGCTTTAGA AAATACTGAT ATCGAATTAG GCGACAGAGG 1380 

20 CGAAATTAAA GTCAATGCTC ATCTTCAAAC AACTGTGCCG CATATTTATG CTGCAGGTGA 144 0 

TGTTAAAGGC GGACTTCAAT TTACGTATAT ATCTTTAGAT GATTATCGAA TTATCAAATC 1500 

AGCGTTATAT GGTAATCAGT CACGTACGAC TGACAATAGA GGCAGCGTGC CTTATACAGT 156 0 

25 ATTTATAGAT CCACCATTAT CACGTGTTGG ATTAACTAGT AAAGAAGCTG CCGCTCAACA 16 20 

TTATGATTAC ACTGAACATC AACTTTTAGT AAGTGCTATA CCTCGTCATA AAATTAACAA 1680 

TGATCCAAGA GGTTTATTTA AAGTAGTCAT TAATAATGAA AATAATATGA TTTTAGGTGC 174 0 

30 TACATTATAT GGTAAGCAAT CTGAAGAATT AATTAATATA ATTAAACTTG CGATTGATCA 1800 

AAACATTCCA TATACCGTAT TACGAGATAA TATTTATACG CATCCTACGA TGGCCGAATC 186 0 

ATTTAATGAT TTATTTAATT TCTAGACAAA ACATAAAAAC CTGGTGGCAC GCATTGAATG 192 0 

35 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 1980 

TATCTTGTTC TTCTTTACGA CGTTTACGTT CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 204 0 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT GAGAAACTTT TTCGATTAAT TCTTCTTCAG 2100 

40 

TAAGCGCagc CAGTGGGCGG TTATTAACAA AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 216 0 

AATAAGATTG ACAACCTATC ACGATTT CAG CATCGGGATC TAATTTTTCC AACTTCTTTT 2220 

GTAACGTTCT TATATTGACT GCCTGACATT CATCACAAAT AAGGAATGTA TTTTT CAT AT 2280 

45 

TGCTACCCAC CTTTCTTTAT CATATCTATA TCGTCGATTT CATTAATTTT TTCGTTAACT 234 0 

CTATCTATTT TACTCTTTTT AATAT TTTTT T CAAGAT AC G TAACACGGCT GaCAATAAAA 24 0 0 

50 AATGG AG CAT TTATCTTCTA ATTAAATTAG ATGaTTGCTC CCCTATCAAA TCATTTATTG 24 60 

CCCATGATAA ATATTAAATT TT AATGG TTT AATACCATGT TTTGTCCATT CATCATAAAT 2 52 0 
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TGATGTTTTA GATGCGCCGT GATACTTTTC AGCAATATCA CACAAATATT TTAGCTTTTC 264 0 

AGTTTCTATA TCAACTGTAG CTTCTTTATC CATACGTTGA ATAATTGTAC GATTCTGACG 270 0 

5 CACCATCTTT TGCACACCTT TAATGTTATT TGTTTTAAAA GCATGAATAA GTTTTTCAAC 27 6 0 

ACAACGATGT GAATCTTCTA AGAAGTCACC GTAAAATGAA GGATCTGATT TCAAACGTTT 2820 

CACTTCGCTA ACAAAGTGTG GTGATGACGC CGGTGAGCCA GTCCAACCGA TAAGTACTTC 28 80 

10 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT GTGCAATCCA GGCCAGTTTT TGATTAAAAC 294 0 

TTCTTCAACC GTAGTATCTT CAATTTGATG CTTAACCCAT TCATGATCAA AAGTACTATA 3000 

CGCTAGCCAT CCACTATATA CACTCACAGC AATATCTCCG CATGAACTTA AACTTTGTAA 3060 

15 

CTTCATATTT GCAATCACTG CTAGTTTATA AATGTATAAA TTAGATAACT TCATATCATA 312 0 

AAATTCATTT AATACTTTTA TAACTGACAC AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

2Q TTTATGACCA TTTGAATCAT CTAAATTACT ATCAATAGTC AGATGAAAAT GCTTCATCGC 324 0 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA TATTTCAATA GCTGTGaCCA CATAATTTAA 3 3 00 

TTGTTTTGCT GcATGTGGAT CTGAAATGAC AATACTATCT TCATCTCTAC TAAATGTAAC 3360 

25 TGGGTTATGA TGTAATGCTT TTGAATGAAT GGTACCTTTA TATTGGTCTG CTTCTTCAAT 3420 

AG TAG CAGTT ACAAAACGAT CTAACGCAAT AAGTACAGAT TTATATCCTG GTTCTGTTAC 34 80 

AGCATATTCT CCAGCAATAT AAAGTTTTCC GGGTGCTTTG ACCTGAATCA TTTTATCTCT 354 0 

30 TCCTTACTCA ATTATTTCAA TTCCTGTGGC AATAATGTCA CTATCAATAA TTTGGTTATT 36 0 0 

ATCAAACTGT GTTAATAATT TATCTATAAT CTGTTGCTTG TTTTTCTTTT CTACAAG TAT 366 0 

TTTCACATTA GGTCCCGCAT CCATTGTAAA ATAACACGGA TACCCCGCTT CTCGG CATTC 3720 

35 

GTGAACAAGC GCCATGACAT CATAACTTTC TTGCACAAGA TATGTGAACG GCGGTGTTGA 3 7B0 

TCCTAGATTC GTGGCATGCA TACGCAAACC ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 3 84 0 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC TTCAG CTAAA TCTTCATCAA TATGATCTAA 3 90 0 

40 

CCAATATTGA TAAAACCTTG ATGTGTTTCG TGTCAATGAC ATACCATATC GACTAGGTAC 3 96 0 

CTTTTTAGAA TGTTGATTAA TCACAACAAA TATCATGGCA AGGTCATCTT CAAAATGATT 4 020 

CGATTCAAGT GGAACGGCAT ATGACGTCTC ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4 030 

45 

TCCACCATAA ATACTACGCG ACGCAGAACC CGAACCAATT CGCGCCAATC TCGATAAATC 414 0 

CTTATCTGAC AGCTGCATGT CTAGCGCTTG ATTACAAGCT GCTGCTAAAG CTGCATATGC 4 200 

50 GCTTGCCGAT GAAGCCAACC CTGCTGCTGT TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4260 

ATACCAATCG ATGCCAGCTC TATTTCTGAC AA TAT CC AT A TATTTTGAAA TTTTCTCTAA 4 3 20 
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AAAAGTGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATT CAT 44 4 0 

TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTG CAA TATTCGTATG 4 500 

5 

TGCACGTGCT TTGCCACTTT TAATCAACGG ATTAACCTCC TAAATTCTCA ATCCAAGTAT 4 560 

GTGCTGCACG AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCTTTGG 4620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4 6 80 

10 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 4 74 0 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 4 80 0 

?5 CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TAC TGAGGGT 4 860 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 4 920 

CTATAACAAC CATATAGCCG TCTAAACTTA A CG TTTTCAA CGTTTCAGCA TGACCTTTTT 49 80 

20 GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 504 0 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC 5100 

CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 516 0 

25 CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 522 0 

ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CG CAT CAT AT AACATACCAT 528 0 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTT 534 0 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 54 00 

CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTTGTC ATGTCAATAT 54 6 0 

CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TTTATTAGTA AAAGACATCA 5 52 0 

35 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 558 0 

ATATTATGTA TAATCTTTAT TTTGAATTTA T ATTTG A CTT AACTTGATTA GTATAAAACT 564 0 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGAGTGAT ATTTCAGATT CTTTATCTTT 570 0 

40 

TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 576 0 

TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 

46 CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 5880 

TTTTCTTATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGTTCAAT 594 0 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6 000 

50 CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 

TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 6120 
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TTACATCTTG AACAGTAATC GTTTTGTTAA GCAATGTCTC TAATGAGGCC ATACAAGATG 624 0 

GTTCAATTTC AGGATATTTA AATTTAGTCA CTTCACCTTT TAAAGCATGT TCATAAAATG 6 3 00 

5 TTTGCATCAT CAATGCACGT TCTGAACCAG AGCCTTCAAC ACAAAGATAA ATTTGTACAG 63 6 0 

CAATACCGCC TCTAACTCTT CGTTGCGATA TGCCTGCAAA TTTCTTACCA TCGATACTTA 64 2 0 

AGTCAAATTT TCCTGGGCAA TAAGAATGTT CAATTTCCAT CGTATCAATA TCAACATTCT 64 8 0 

10 CATTTTCGAA CATTTTGCTA ATTAAGAGGT ACATCACAGT AAACGCTTCA TCAATCGTTG 6 54 0 

TTTCTGTTTG TCCTTTGAAC ATCAGCGATA T ATTTAAT A C ACCTTGATCT AGAACGACAC 6 600 

CTAAGCCACC AGAATTTCTA ACAATGGCAT TATAACCAAT CTCATTCGTT AAATAATCAA 6650 

15 

TGCCATCTTT TAAAAACGGC AATCTTGAAT CATGAATACC AAGAATAACA GTATGTTGAT 6720 

GAATCCAAGT ACGCACAACA TTATCTGATA TATCTTTGCC CACACTTTCG CAAAATGTAT 6 78 0 

CATCGAATGC GAAAGATTGC ATAGGTTCTA ATCCAGAAGA ATGATCGATA TATCGCCAGT 6 84 0 

20 

TGACGCCATT AAAATATTTA CTCGCTAAAT CCATCGTTAT TGTAAGGCTT GCGCTGCTGT 6 90 0 

AATAATTGAA AGATTGTATA CATCTTCAAT TGAGCAGCCA CGTGATAAGT CATTTACTGG 6 96 0 

, ?5 AGAATTTAAA CCTTGTAATA CTGGACCAAC TGCATCATAT CCACCTAAAC GTTGTGCAAT 702 0 

TTTGTAACCA ATATTACCAG CTTCTAAACT TGGGAATACA AAGACATTTG CATCACCTTG 70 8 0 

TAATTTAGCA CCTGGCGCTT TTTTCTCAGC AACACCTGGT ACAATCGCAG CATCAAATTG 714 0 

30 GAATTCGCCA TCAATGATTG CTTCTAATTT TtCTTCTTCA GCTTTTtGTT GTGCTAATTT 72 0 0 

GACAGCTTCT TGAACTTTTG TCACGTCGTC TGATTTAGCA GACCCTTTTG TTGAAAAGCT 72 60 

TAACATTGCA ACTTTTGGAT CCATGCCAAA GCTTAATGCT GATTTTGCAC TTTCTACTGC 7 3 20 

35 AATTTCTGCA AGTCCTTGTG AATCAAGTTC TGGATTGATT GCACAATCAC CAAAGATGTA 73 3 0 

TTGTTCATCA CCTTTAATCA TAAAGAAGAT ACCTGATGTT CTTGATACAC CTGGTTTCGT 7 44 0 

TTTGATGATT TGTAAAGCTG GACGCACAGT GTCGCCTGTT GAATGTGCTG CACCACTAAC 7 50 0 

40 

TAAACCATCT GCTTTACCAG CATAAACAAG CATTGTACCG AAGTAGTTCA CATTGTTTAA 7 56 0 

TAATTCTTGT GCTTGTTCTT CAGTCGCTTT ACCTTTACGT CGTTCAACAA ATGATTGAAC 7 62 0 

TAATTCAGCT TTCAATTCAC TTGTCG CAGG ATTAATTAAT TCAATATTAG AAATATCAAG 7 680 

45 

ATCAAGTTTT TGCGCTAAAG ATTGAACCTT AGTCTCATCA CCTAACACGA TTGGTGTAAC 7 74 0 

ATAATCTGTT GCTTGTAATT GTGTTGCAGC TGTTAGAACA CGTTCGTCCT CTCCTTCAGG 7 800 

TAATACGATT TTAACGTTTT TACCAGAAAG TTTGTCTTTT AATACATTTA ATAAATCAGC 7 86 0 

50 

CATAATGTCC TCCTGTAATA TAAATCTTAT TAATCATTCA CGGTATAATT ATACGCCATT 7 92 0 
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TATGATAAAA TTTATAAAGA ACTGATGATT 
CAGCCGAAAC ATTAGATGGT TGGTATAGTC 
5 CATTACGTAT AGTTCCAAAG GACGAACGCG 

TAGAAAATAC AGCAACTGTA AGATCATCAA 
CTGGTCAAAA AGCAGATTTG TTATTATGGT 

10 

ATATTGAAAA TGAATTTAAC AAATTGCGCA 
ATGTATCAGT CATTGAATTG AGCAATTATT 
AGAACCCTCA TATCAAAGCA AGATTATACC 

IS 

TCTATCCAAT GAACAAACGT CGTAATGAAA 
AACGCCAAAA ATTAATGTAT GACCATGGTA 

2Q AACAATTTAT TACTGGTTCT GTAGGGTTTG 
CAGATGACGT ATTACAATTC AAAAAAATTG 
CACGATACGG TGAATTCGGT AGTTTCTTTG 

25 ATCAATTCTT TGCGATTTCT TAATACATTG 
AGCTTATCGG TTTAAGCTTG GAATTTTTCG 
GACGTGATTT GGTAAATAGT TGAAATCTGT 

30 ATTATCAATA ACAAAGtACA CTTTGCTCAT 
GAACTTTAAC TACTATCACT ACATATAAAC 

ACTCTATACA ATTTTTGTAA TGGTTAACTA 

35 

AACATATTTT TGTCGAATTT TTTTCAATAA 

AACTACTAAA AATTCCGATG ATTATTATTA 

CAATAACAAC TTATCAAAAA GCTCATGTGG 

40 

ATTAAATTCT TATAAACAAT GGTGATGAAA 
AGGTACTTAT CAAAGACGTT GTCATGCTTG 
AAGGTACGCG TGTAGAAGAT ACCATGACAC 

45 

GTAACAGCTT TGTTACAAAC ACTGTCATCC 
GAATATTTAG AATTACCTCT CGAGATACAA 
50 TTTCGCGTCA AATTACAAAC AATGAAATTT 
AAATATATGT TGCTAAGCGT GACAGCAGTC 



TTTGAAAAGG AGCGATAAAC ATGAGTCAAG 8040 

TACATTTATT TTATGCAGTT GATTGGGCAT 8100 

ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

AATCTGGTGA TCAAGCTATT TATAATATAA 822 0 

TCTTACGTCC TGAAATGAAG TCTTTAAATC 82 80 

TTGCTGACTT CCTAATCCCT A CAT ATT CAT 834 0 

TAGCTGGTAA ATCTGATGAA GATCCTTATG B4 00 

CAGAATTACC ACATTCTGAT TATATTTGTT 84 60 

CTTATAACTG GTACATGTTA ACTATGGAAG 8 52 0 

TGATTGGTAG AAAATATGCT GGCAAAATCA 8 58 0 

ATGATTTCGA ATGGGGCGTA ACATTGTTCT 864 0 

T AT AC GAAAT GCGCTTTGAT GAAACAACAG 8 7 00 

TAGGACATAT TATTAACACA AACGAATTCG 376 0 

GTACGTTTAT AAATTAATAA AAAAATTCCA 3 82 0 

TTTATCTTCA GTATATTCCC GTATACATAA 888 0 

ATGTTTAAAC TTATATATAT GTGCTAATGT 8 94 0 

AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 900 0 

GTTAACCTCA ATAGAAATTA TACAGTCGCT 90 6 0 

ATATTATTTT AACCTATTTG AAATATTTGA 9120 

TTTTTCCTTT TTATACTTCA AGAGAATTTT 9180 

CAATAGTATC AAATATTAGT TTTTTAAAAT 924 0 

TTATTTTATA GTGTATAAAC TATAATGAGT 93 00 

TGG A CAT AAA TTCAGAAGAA TACAAACAAG 93 6 0 

CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 94 20 

GTATTGCAAA AAAACTTGGT TACAGTGAAA 94 80 

AGTTTACGTT ACATTCGGAA TCGTTTCCTA 954 0 

ACTTAATAAA AATTTCTCAA GCTAATAAAA 96 00 

CTTTAGCCGA AGCAAAAACG CAACTTGAAA 9660 

TTCCCTTTAA AGGTTTTGCT GCAGCAATGA 972 0 
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TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 984 0 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA t TGGGATTAT CGCCGTTATT GG A CAT A CAC 9900 

5 TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 996 0 

GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10 02 0 

CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 
w 

AATTTTAGTA TAGGAGTATC AGA CTATGTT TTGGATCTTA AACTTTATCT TTAGCTTTTT 1014 0 
AGCTTC 1014 6 

(2) INFORMATION FOR SEQ ID NO: 244: 

15 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

25 ATTTAATTGG TTGGTGGCGT ATTCTCnTTT CCAAGATTTG CCACGATTGG ATGTTCAGTC 60 

GACAAATATG TAACCAGTCC CGATAAATTA CGTTCCTCAA TCATTGATAG TATGTCTGGT 120 

GTTTGTAACG ATTTATCGAG TAAGGCATCA AGCAATTGAT AATGTCCCAA TACAACAAAA 180 

30 TGCACGTTGT CTCTCAACTG CTGTTGAATA AACTGAATAA AGAG CTTTAA GCTCTGTTGC 24 0 

ACATTCGCTA ATGATGGTGC ATAGTTTTCC AAACCAACTT GTACAGCCGC TTCATTATTC 3 00 

C G AA TG ATT A AACCTGTGTA TGCCACTTTT GTTGCTGCAG TTGGATACAT TGAGTAATAA 36 0 

35 CGCAATAATT GATCTGTAAA ATCATTTCGA AGTGCATAAA TTTGATGCTC ATGTTGCCAA 42 0 

AAATfTCGCT CACCCATCTG CTgCAAATCC TCATGGTTCA ATTGTTTCCA GTCCAACTTT 480 

TCAACCACAC TAAAATCAAC TAACTCATAA TCCGCTTTAT TAAAATATTT TAAAAATGCT 54 0 

40 

GTTTCCGATT CTTTTAACGC AATTAATTGT TCTGnATTAT TCACTCGACC ACCCTTTACT 6 00 

TTCAATACTG TATTTAAAAT CACTTGGTAT TTTCGTTGTT TGCTTTACTT CTCTACCACG 660 

CTAAAGTGTA ATATGATTAA TAACTTATCA TTTTTAGCAA TACATTACAA CCTTTTTCAG 720 

45 

AAAATTCGGT GTATTGATTT TAAAATTTTT TAAAATAnAA AAGGCAAGAC ATTTGTGCCT 7 80 

ATAAAAATGC TTAACCAAGA TTTTTATATT GaAGTTGTAC TTCTTGCACA TATTGTCCTT 840 

SQ GCCTTATTAT GTAAAGTTAT TTTCTTTCTA TCTTTTTATT AAATTTAACT ATTCTTCATA 900 

ATCCCGATTC CCTTTAAAGT AACGTCTATC TTGTTTACTA TATACATTTT CAGGATTAAA 960 
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TTGGTAAACG TTCGTTGCTG AT AT AT CTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 108 0 

CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 1140 

5 

ATGTGGAAAT ACAAACATAT CTGTTTTACC T A CAATTGGT TCAACTTCAT CTTTCCATCT 120 0 

TTTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTTTTTCA AAATTAATGT GACCATATGA 126 0 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGC GCGA ACAACTTCCT CAGCTGCCTT 132 0 

10 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTG CGATAACCTA ATACGCCCTC 13 80 

ATAACCGGTT AAAGCAACAA CACCTTTTTC ACCATTTAAA GAAAAATCTG GATGCTCTTT 144 0 

/5 TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 150 0 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTTATCG AGTACTAAAC GGTCAG CAT A 156 0 

ACCATGGTGT CTCATGTAAC TATAGTAATT CAT AT CAT CA ATTGAGATGA TTAGTGGCTT 1620 

20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 16 30 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 174 0 

TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 18 00 

2S CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA 1860 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 192 0 

TTTATTkTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 1980 

30 

CAGAAAAAAT AG CAATAAAT ATTTTTTATG CATTAAACAT TC 2 0 22 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 245: 

ATAGAGTGAT AAAGGATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAATTTTAA 60 

45 TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CTTATTTATC AGTGCGCTAG 12 0 

TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 180 

TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATG CTTTATT AGGTGGATTT GCAG CATTAA 24 0 

50 TTTCATACAG TGGTATCACA GACTATTTAG TAGGAAAAAT TATAAATGCA ATTCACGCTG 300 

AAAATAGTCG ATGGTCAAGA GTTAAAGTCA AAGTGACAAT AATCATTGCA TTATTAGCTA 360 
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CACCATTGTT AAGTCTGTTT AATGACTTAA AAATAGATAG A CGTTTAATC GGTTTGATTA 4 80 

TCGGTTTTGG TTTATGTTTC CCGTATGTGT TATTACCATA TGGATTCGGT CAAATTTTCC 54 0 

5 AGCAAATTAT TCAAAGTGGC TTTGCAAAGG CAAATCACCC AATTGAGTTT AATATGATTT 600 

GGAAAGCAAT GCTTATTCCT TCAATGGGGT ATATTGTTGG CTTACTTATC GGTTTATATG 66 0 

TATATCGTAA ACCACGTGAA TATGAAACAC GTAAAATTTC AGATAGTGAC AATGTTACAG 72 0 

10 

AGTTAAAACC ATATATCTTA ATAGTAACAA TTGTAGCAAT ACTAGCTACA TTTTTAGTAC 78 0 

AAACATTTAC AGATTCAATG ATTTTTGGTG CACTGGCAGG GGTACTCGTA TTCTTTATTT 84 0 

CACGTGCATA TAATTGGTAT GAATTAGATG CTAAGTTTGT TGAAGGTATT AAAATTATGG 900 

15 

CTTATATTGG TGTAGTTATT TTAACAGCAA ATGGATTTGC TGGTGTAATG AATGCTACTG 960 

GTGATATAGA TGAATTAGTT AAAACTTTAA CAAGTATTAC TGGTGATAAT AAATTATTTA 1020 

GCATTATCAT GATGTATGTG ATAGGTTTAA TTGTCACTTT AGGTATTGGA TCATCATTTG 10 80 

20 

CAACAATTCC TATTATCGCA TCATTATTCA TTCCTTTTGG AGCGTCAATT GG AC TAGAT A 114 0 

CAATGGCATT AATCGCATTG ATTGGAACAG CGAGTGCATT AGGTGACTCA GGTTCGCCTG 1200 

2 5 CAAGTGATTC AACATTAGGA CCAACTGCGG GATTAAATGT TGATGGCCAm CATGATCATA 126 0 

TACGTGATAC ATGTGTACCA AACTTCTTGT TTTATAATAT TCCTTTAAAT GATTTTCGGT 132 0 

ACTATTGCTG CTATGGTACT 134 0 

30 (2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 6: 

40 

CAAAATCTAA GAGAATAAnA TTTGTTAATT TnAAATAGCA AGCAATTCAA AGTTATATGT 6 0 

GTAATAGATA AAATAGATAT CCCTATAGTG ATGCGTTACT AGCTAAACAT AATAACACAT 12 0 

TAGAAGATAA TGAAGTTAAG GAGTTACTGG ATTGTTTCGA CTATGTAATT AAGTATAAAA 1B0 

45 

ATATCCAACG ACAAAACGTA ATTATAAAAT GGTAAAAGCT ATGGTACAGT TTCAAATTGC 24 0 

TAATGACATG CGTATCGGTG AGCTACTTGC AATAAAGAGA GTAAATATAA ACTATGAAGA 3 00 

50 TAAAACGCTA GATATCGACG GTAAAGTTAA TTGGATAACT GAAAAAAGAC GGGAGCATTC 3 60 

GGAGTAAAGG AGACAACTGA AAGAAGTAAT AGCTATAAGG CCACAGGGCT CACTACCCAA 42 0 



55 



1037 



EP0 786 519 A2 



w 



15 



45 



50 



TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG 54 0 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 600 

CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 660 

CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 72 0 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 780 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 84 0 

TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 900 

TCTTCAAACT wGcTTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 960 

AATTTGCTTA TTTAT CAGAT TTAGCAACTA TAGATATGaA ATTAAGATAC ACAATGTTGC 102 0 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 1080 

ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 114 0 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 1200 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 126 0 

ATAAATACTA TCmAAATCCA CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 1320 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATTG AGAGTTGCTA 1380 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 144 0 

TAG CAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 150 0 

TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTlcTC ksTACGCTAG 156 0 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 1620 

AACTTCAAAA G C GAT AT AAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 16 8 0 

GATTTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG A C CAAAAGT A TAAACTATCT 174 0 

AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 

GAG CTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TT CAT ATT AT 1660 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CAT CATC CTT 1920 

TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCCG TAAGCCTATG CcTACGGGgT 1980 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 2040 

AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTT AC A CG 2100 

ACGTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 2160 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 2220 
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CGTTAATTTT TATTAGAATG GTAAGTCATC ATCACTTATA TCAATCGGTC CGTTTGCATT 2 34 0 

TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATTA TTGTACGAAT TGTTTTGTCC 24 0 0 

5 TGATTGTTGA CCACCGAATC CTTGACCGTA ATCTTGGAAT TCATTTTGTT GACGTTGGCC 24 6 0 

ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAACG CTATCACACA CAACTTCAGT 2 52 0 

AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 258 0 

10 

ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 2 64 0 

AACACAGTTA ATAAAATctG CtTCGCGCTC CCCTTGAGCA TTCGTGAACG TACGATTTAC 2 70 0 

TGCAAGAGTG AATGtCGCTA CACTCACACC TGAGGGAGTG GTTCTGTATT CCGGATCTTT 276 0 

15 

CGTTAAACGA CCTACTAATA CAACTCTATT TAG CATTT AA ACGCCCCCTC TAATTATTAC 2 82 0 

TTGTCTTCGT CTTCACGAAT AACCATGTAA CGAATGATAT CGTCACTGAT TTTAGCTAGA 2380 

CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 2 94 0 

20 

CCATCTTTGA AAT CATTG AT TTCATAAGCT AGGCGACGTT TACCCCAGTC TTTTGCTTCT 3 00 0 

AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 3 06 0 

25 GCATCTTCCT CAATGTTTGG GCGTACGATG TACATAACTT CATATGTTCT CATTTTATAT 312 0 

TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGGaATAAT 3180 

TTTCATTACT CACAATAAAG AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 3 24 0 

30 AACTACTCTT CATATCATTT TTGATAT t AA TTCATTTGaA ACTTTCnATG ATATTTTnAA 3 3 00 

AAATACACTT CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 3 3 60 

ATTTA 3365 

35 (2) INFORMATION FOR SEQ ID NO: 24 7: 

4i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAAATCA 6 0 

ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 120 

TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT 180 

ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 24 0 
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ACTAGGTATA CATGATACTA CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG 36 0 

TGTTGTACTT GCTAAAACAT TAATAGAACA ACCAGATTTA TTGTTATTAG ATGAACCTAC 42 0 

5 GAACCATTTA GACTTCGAAT CAATCAGCTG GTTGATCAAT TATGTGAAGC AATATCCTCA 48 0 

TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 54 0 

TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTGAAAT 6 00 

10 

GCGCGCTGAA AGAGAAGTAA CACTTCAAAA GCAACAACAA AAGCAACGAG CTTTATATAA 66 0 

GGAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG TACTACAAAG CAACAAGCTA 720 

GAATTAATCG ATTTAATGAC CTAGAmAATG AAGTTaACCA GCAATATAAA GACGATAAAG 78 0 

15 

GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAGT GTTCGAATTA GAAGACTTAT 84 0 

CAAAGGCTAT TAATGATAAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG 90 0 

OQ AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 96 0 

GTGGAGAAGA CCAACAATTC GAAGGTAAAT TGAAG ACT GG GCAGACGGTT AAAGTAG CTT 102 0 

ATTTTAAGCA AA 103 2 
25 (2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 24 8: 

TGTGATTAAC GAAGCTTATT TTCGTACACC TTCAACAACT GATTACAACG GCGTTTATCA 6 0 

AGGTPATTAT ATTGATTTTG AAG CAAAGG A AACTAAAAAC AAGACGTCCT TTCCTTTAAA 12 0 

TAATATTCAT GACCATCAAG TCGAACATAT GAAAAATGCA TATCAACAAA AAGGTATTGT 18 0 

GTTTTTAATG ATTCGTTTTA AAACGCTAGA TGAAGTTTAT CTTTTACCCT ATTCAAAATT 24 0 

CGAAGTATTT TGGAAGAGAT ATAAAGATAA TATTAAAAAG TCTATAACAG TTGATGAAAT 300 

ACGAAAAAAT GGTTACCATA TTCCTTATCA GTATCAACCA AGATTAGACT ATCTAAAAGC 360 

AGTTGATAAG TTGATATTAG ATGAAAGTGA GGACCGCGTA TGACGGAAAA CAAAGG A TCT 420 

TCTCAGCCTA AGAAAAACGG TAATAATGGT GGGAAATCCA ACTCAAAAAA GAATAGAAAT 4 80 

GTGAAGAGAA CGATTATTAA GATTATTGGC TTCATGATTA TTG CATTTTT CGTTGTTCTT 54 0 

TTACTAGGTA TCTTATTGTT TG CTT ATT AT GCTTGGAAAG CACCTGCTTT TACCGAAGCT 6 00 
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TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA AT CAATG AAA 7 20 

GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 780 

CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCtGA AGGtGCCTCA 84 0 

ACATTAACAC AA 8 52 
(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

20 CACTTTTTTC ATTAAAAATC TCATATTTAT ACACTGAACG TAATCTCGAA TATTTTTCAA 6 0 

CCCAAGTTTT AACTTTAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 12 0 

GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TTTCCATATC AGCATAACTA AAACCTAACT 180 

2S TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 24 0 

CACCCATCTT ATCAGTTTCA GCATAACGCG CTTCTATTTC TGTAATACTA TATATCATTT 300 

TAAGCCTTCT TTCAGTTTAA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 36 0 

30 GCCAAGAACA CAAAGGkTTT GAACCGACCT ATTATATCAT AAaGTTTATA GAAGTATTTT 42 0 

TGAGCACTAT CAAAGTGCCT CAAAT AC CG A TTAAAATTTT ACTGTGATAT CTATTTTTTA 4 80 

TTGCGCTAAT TTATTTCTTA AAACCATTTG TAAAATTCCA CCGTGACGAT AGT AAT C CAT 54 0 

35 

TTCAACAAGT GAGTCAAAAC G AA C CAT AG C GTCAAATTCT ACCAAATCAC CATCTTGCTT 600 

CTTAp CAGT A ACTTTGACGT AGt CATGTGG TTGAACATTT TCATCAATAT T AACAG r AAT 660 

TTCTTCTGTA CCATCTAGAC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 720 

40 

TAATACACCC AT CATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 780 

TGTTTTAACA CCTAATAAGT TTGTACCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 84 0 

45 ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 900 

ATCAAAGATA GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 96 0 

TGGCGCTAAT TGGTTTTTAA TACGTATATT AGCAAACGTA CCTCGAACCA TTACTTCGTG 102 0 

50 ATTACCACGT CTTGAACCAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 1080 

ATATTTACCA GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 114 0 
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15 



30 



45 



TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT TAGGATCAAA 126 0 

GTCATATAGA GGTTGATCAG TTACAT CAAT CTCATTCCAT AATTCGTTGT TATTGTATAC 132 0 

G TT ATT A TAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 13 8 0 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 144 0 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

CTGTGGTGAA GCTAGGTAAT TGG CTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 156 0 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC 162 0 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 1660 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 174 0 

TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 1800 

TTTTTTAGCA ACTAAACCTG cACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 186 0 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 1920 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 204 0 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 216 0 

TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 222 0 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTGCAATTGT 22 8 0 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 234 0 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 24 00 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC AC CAAT AA C C TCTGGAATTG GGAAATAAGA 24 6 0 

AGGTTGTCCA AGCATTCCAG CTTCAGCTTC AATACCACCA ACACCCCATC CTAGTACGCC 2 520 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 2 5 80 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATTCTA AGTTAACTTG 264 0 

GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 2700 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 2760 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2820 

TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 80 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 294 0 
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40 



GTCTGTAATT ACAAAATCAT CTTCTTGACG AAGTAAAGAT TCTAACAAAA CACGAATTGA 3 06 0 

ATAAGG7AAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 3120 

AG T AT AACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3 24 0 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3 3 00 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 3360 

TAATTGAGAA TAATTATCAA TTTCGCGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 34 2 0 

CATTGAGCAG AAAGAAAATT ATGGCACCAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 34 8 0 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 3 54 0 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAG CTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA CCCCAACTCG 3 66 0 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAGAATTGAA 3 720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTT CG TAG AG 3 780 

CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 3 84 0 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTTCAG 3 9 00 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATTCG TAGTTTAATA ACGAAATTCT 3 96 0 

CGTTCTTATC CTTTTGAATA TACTCAATTT TCCACAAAAA CAAACAAGTA G TAT AT CTGT 4 02 0 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 0 80 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATTCATAATC 4140 

TTCTTGTTGA CT CTT GAT AT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 42 0 0 

TAGGAATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT AC CATCATG A GTATGACGAC 4 2 60 

AAAGGAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4 3 20 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCCCA 4 3 80 

AACAACTTTT AC CAT ACT AG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 444 0 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4 500 

CGATAAAACA ATGCCAAATG GCACATGATT AAACACTCCA AATAGCTGTG TTTCAGGAGT 4560 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 4620 

AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 4 6 80 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4 74 0 
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TGCTGTATCA AAACTATTAA ACAAGAATGT GTTTAGTAAA CTACCCGTAG AGCTAGTTAA 4860 

CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA GCTACCATTA AAATAGTACC 4 92 0 

5 TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA CTTAATCCAG ACCATGCACT 4 98 0 

TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT TGTACAAACG TATTGTTTGG 504 0 

AACATTGAAC AAGTAATGTA AACCACCATT AATTTGTAGA GCACCCATAC CTAACGAAAC 5100 

w 

GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA ATAAAAATCC CAATAGGACC 516 0 

TTCTACTTTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT AATAAACCTG GTTCACCTTT 522 0 

ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA AGCATGGAAT 52 8 0 

15 

CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG CTTCAGTAGT TTTGGGATCT 534 0 

GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG CACCATAAAA CACCAAACCT 54 00 

ATCCCCATAC CAGCACTAAA CAACATAGCA AACCATGAAA TTGTATTAAA CTCAGGTTTG 54 6 0 

20 

TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA AAATAAGGAA TATACAGAAG 5 520 

AACACGATAA TCGTAGTAAG AATAAGATAA TACCAACCTA ACTTTTCTGT AATC CACATT 5580 

2s TTAATATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA ATGCACCAAG TAATACGACT 5 640 

AT AG CAACAA CAATTGCACT ATAGATGAAG ACTGGTGAAT ACTTCTTTCC ATTTGGATTC 5700 

TCTGGTGAAG AAGAATTCAT AATTAATTAC TCCCTTCAAT TCTATATTTA ATTTTATGTA 5 760 

30 GTAGAATAAA AATATTATCT AAACATTTTA TTCAATAACT CACG 5 8 04 
(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
T (D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 250: 

ACCCGCGAAT ATGGTCCATC CTATCGATTT ATTTTTAACT GGTTTGACAA TATTTAATTT 6 0 

TTCATAATCA TTCTTAGTGA TTTTGACATA TGTTTTCGGT ATGAGCCAGT TAATAAATGG 12 0 

45 

AAAGAAGAAG ACAATCCAAT TACTTGCCAA ATCaATCATT AAATATTCAC TATCGTATTT 180 

GATTATTCGA TATTTAGGGT TTTTATTAAT AACTTTAGAT TCGCAAAGCA ATGTCTCCAC 24 0 

50 ATCC CTTTAA TTTTATGTGT AATACATTTT TCGATACTTC AAAAGACATT CAAATACTAT 30 0 

CAAGTTACTG TCATCAAAGG TTTTATTAAC TGATATTtTC ATATTTTTAa TCTGAATTTA 3 60 
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w 



20 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCAGGGTGCG GAAAGCTTTA AAATTTGGAC TAATAAAGAT G CTGATATT A ATTCTATGAA 60 

AACAGCAGTT TTACAACAAT TAAAAGGAGA ATAACATATG CTTACTGGcA AACAAAAAAG 12 0 

ATACTTAAGA AGTTTAGCAC ACAATATTGA TCCGATTTTT CAAATTGGAA AAGGCGGTAT 180 

CAACGAAAAT ATGATTAAAC AAATAGATGA TACGTTAGAA AACAGAGAAT TGATTAAAGT 24 0 

ACATGTACTA CAAAATAACT TTGATGATAA AAAAGAATTA GCTGAAACAT TAAGCGAAGC 30 0 

TACTCATAGT GAATTAGTGC AAGTGATTGG ATCTATGATA GTGATTTATA GAGAATCTAA 36 0 

AGATAATAAA GAAATTGAAT TGCCATAATA ATGAAAAAGA TAryACTTTA CGGCGGTCAG 420 

TTTAACCCTA TCCATACTGC ACATATGATA GTAGCTAGCG AAGTATTTCA TGAATTACAG 480 

CCAGATGAAT TTTATTTTTT ACCTAGTTTT ATGTCTCCAT TGAAAAAGCA CCATGATTTT 54 0 

ATAGACGTTC AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTTTTGGA 600 

GATATTTGTG ACGATGAAAT TAAACGTGGT GGTCAAAGTT ATACCTATGA CACGATCAAG 660 

GCATTCAAGG AGCAACA CAA AGACAGTGAG TTGTACTTTG TTATTGGGAC GGATCAGTAT 72 0 

AACCAACTAG AGAAATGGTA T CAAATTG AA TACTTAAAAG AAATGGTTAC TTTTGTAGTT 78 0 

GTAAATCGAG ACAAAAATAG TCAAAATGTT GAAAATGCTA TGATTGCAAT TCAGATACCT 84 0 

AGGGTAGATA TAAGTTCGAC AATGATTCGA CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 900 

GTTCTTGTTC CTAAATCCGT TGAAAACTAT ATT AAGGGGG AAGGATTATA TGAACATTGA 96 0 

AAAA 964 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
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10 



75 



20 



TTGTCATGAA ATAAATGGGA TGAATATCAC GACTAGAAGT AATGTTACGA ACAGGAGCGT 120 

ATAAACTAGA GACGCTAAAT TCGACATAGT ATGTnGCTCA ATTATGGCTG ATGATGAATT 180 

TAAAG TATGT GCGTTGGAAC TGTCGGATTT TTGTTCATAA TGTTTTGCAT ATTGCGCCAT 24 0 

GATGAATAGT GTAAAAATAA ATAAAACAAT AAGAGATATA ATGCCCATAA TCAAAAGTAT 300 

TTGTTTAGAG CCTTTCATTA TTTCACATCC TTTCTAAAAT ATATTTGTAA CTAAATTTAA 36 0 

AATAGTTATT TTTGTAATTC TAAACCTTTT TCATCGCGAA AACAATTAAA TAGGTCGCGG 42 0 

TATTAATTAT TATATTATTA CCGCTTAATA TGAAAAATAC ATGAaAATTA ATTTTCTAAT 4 80 

ATACTTTTGA AAAATTATTA CAAATTAGCC CCTTCAAAAC GCGAAAACAT AAGGATTCTA 54 0 

GTTTCAAAAG GGCTGATAAG CATAAAATGA AATGTAATAT TTCGATGTAT AAAATTTTTA 600 

ATTAGCTAAA AATCATCGCA TTAATTTTTT GAGCTACATC ATCAAAATTC GGACATTTTA 660 

ACGACACATA TAATTTAATT TTAGGTTCAG TACCAGAAGG ACGTAAAGCG ATAAATCCTT 720 

CGTCAAATAA GACACGAATA ACATTTGATT TAGGAGAATT AATCTGCGAC GTTGTATCTT 780 

TATCCAAATG ATAAACCTCG CT AG TT AAA T AATCTTCAAT TGCTTTCACT TTGAGTCCTT 84 0 

25 GAATCTCTTG CGG TGGATTT GAACGGAATT TGGTCATTAT TGCATTAATT TTCTTTTTCC 900 

CTTCAAATCC TTCTAG CGTA TGCGgAATAA TGTATCCTCA TGTCTACCAA CAGTTTGATA 96 0 

AATCTGTTCT AATTCATCTT TCAATGTTTT GCCATATAAT TTTAACTCAG AAGCGTATTT 102 0 

30 TATAATGAGT GGCACAATTT GTACGGCATC TTTATCACGT ACAAAAGGCT CTGATAGAAA 108 0 

ACCGTAACTC TCTTCAAATG CGAAAAT CAT ATTTGATGAT CATCCAGTTG TCTTATTTCC 114 0 

TGAGCAATAA ATTTAAGCCC GTCAGCACCT CTTTGGTATT CAACATTATT ATA 1193 

35 

(2) INFORMATION FOR SEQ ID NO: 253: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 
40 (c) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



TATCAGGATG 


ATTATGnCGG 


GGTTTTATTA 


AGTCTGAGCG 


TATATnCCTT 


TTATTCTCCT 


60 


AAATGGTTTG 


GAATTCAAGG 


AGAAG A CG AT 


GAAATGGTAT 


CCAAAAACCA 


AnGAATATAT 


120 


nGCATTAAGT 


CTGAGGATGA 


TAGTGCGGTG 


GCAATTCGTT 


CATTAATTTT 


G CATAAAG AT 


180 


GAACCTATGT 


ATTTAAAAAA 


ACGTACATGT 


GTACCTACTT 


TGTTAATTAA 


TGGGGAACAT 


240 
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20 



AAAAAAATCT TCGAACATTC AGGACATGCA CCGCATATTG AAGAACCAGA AGCATTTATG 360 

AATTATTATT TAAAATTTTT AAAAAGCGTA TCATAATATG TGATATATAA ACCTAGGGCA 420 

TAAAGTCCTT AGGCAATGTG AAAAAG CTGA TTACTATTCA TTATTTGATA GAAATCAGCT 480 

TTTTTTGAAA TGTATTTGAT ATATACTGCT CGTTATGCGG CTATCTTCCT TATATTAAGT 54 0 

GCCATTAGTG CAAAACCTCT TAACAATTAG GTAAAAAGAG CATAAAAAAA GGAAGTTTAA 60 0 

TAGAATGTAT CATCTATCAA ACTTCACCAA ATTGCGCTAA ACAAAATTAT AGTTCAATTT 66 0 

CGTTGTTTGC TTCAGTGATT CGTTTATTTA CTCGACTCAA TAATGATTCG ATTTTTTTAC 72 0 

GTTGTTGTGC ATTAACAAGA ATTAATACAG TTCTTTCATC ATGCTCATTA CGTTTTTTAT 780 

CGAAGTAATC TTCTTGAGAT AAAATTTTAA CTGCTTTAAC AACTTGTGGT TGTTTGTAGT 84 0 

TTAAATGATT AATAATATCt TTAAGATAGT ATTCTTTCTC TTTGTTTTCG CTGATGTATG 900 

TCAATACAGC GAATTCTTCA AAGCTAATTG AnAATTCCTT TTTAATTAAA CTTTTTAATT 96 0 

TGTCAGCATA AGTGACCATT GATAACAACT CAAAGCAATC ATTGATTTTT GTAATTGCCA 1020 

TGTTTAAAAC CTCCCTATTT GATGCATCTT GCTCGATACA TTTGCCCCGA TAATATAnTG 108 0 

TATCTAATCT TTATGnAT 109 8 
(2) INFORMATION FOR SEQ ID NO: 2 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 254: 

CCAGGTAAAA TTGTGCAATC ATTTGACGCA TTAATGGACG CCTTGGACAA TGAAGATTAT 6 0 

GAAGGAGAAA AAGTCATTCC ATT CTTAGAT AAACATTTTA AATATCAAGA TGGCCGATCA 120 

AGTGAGCGTT TAGTCAGAAA TTTATTTGGT AG CT AAGTTT ATATAGTAGT CAAAGTGGGA 180 

GAGGTATAAT GATGAAATTT TCAGTAATAG TTCCAACATr CAATTCAGAA AAGTATATAA 24 0 

CAGAATTACT TAATAGCCTT GCGAAACAAG ATTTTCCGAA AACTGAATTT GAAGTGGTTG 3 00 

TAG TT GATG A CTGTTCAACA GATCAAACGT TACAAATAGT TGAAAAGTAT CGCAATAAAT 360 

TGAACTTGAA AGTAAGTCAA CTCGAAACAA ATTCTGGTGG TCCAGGTAAA CCTAGAAATG 420 

kGGCGTTAAA ACAAGCAGAA GGTGAATTTG TATTATTTGT GGACTCCGAT GACTATATAA 4 80 

ACAAAGAGAC TTTAAAGGAT GCAGCAGCAT TTATTGATGA ACATCACTCA GATGTCTTAT 54 0 
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CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 66 0 

ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 72 0 

5 AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 760 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTGAACA TATGAGTAGT GCGTATGTTT 84 0 

CACCTGAAGA CTTTTATGAA GTCATGAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 90 0 

10 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTTT 960 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 1020 

CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 1080 

is 

TACGACCATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA A CTGTGGAAG 114 0 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATGGTAAA TTAAACATTC 1200 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 12 60 

20 

TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 1320 

TGATTGG CGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 13 8 0 

25 CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 144 0 

AAGATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 1500 

TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 1560 

30 CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 162 0 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 1680 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 174 0 

35 TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 1800 

AAACSTCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 18 6 0 

AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 192 0 

40 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATGA 198 0 

TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATGAG 2 040 

GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 2100 

45 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAGTAAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCGTCT 2220 

50 CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 2280 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 234 0 
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15 



45 



TGAGTTAAAA GAAACATACG ATACATTTAA AAATGG CATG TT AG AG CAT A TG CAAAAAG A 24 6 0 

AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGACGATAT 2 52 0 

TAATACTGTG ATAGATGATT TAGTTTCAGA CCACATTGCA ACGGGAGAAT TGTTAGTAAA 2 58 0 

AATGAGCGAA TTAACATCTA G TTATGAAC C TCCGATAGAA GCGTGTGGTA CTTGG CG ACT 2 64 0 

TGTTTATCAG AGATTAAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATTTAGAGAA 270 0 

TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 2 76 0 

GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 2 82 0 

AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 2 8 80 

A 2881 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

ACCGTCGAAT ATCGCTTGTG ATTTACAATT TGTGTATTAA GATGCTCAAC TAATTTGGGT 6 0 

ACATATTCCG AATTTAGATT TGCAAGTACA ACAATTCCAT AATTTTGTTT TGGATTTAGT 120 

AAAATAAATG ATGAAAAGTT ATCTAGCGTT CCTGAATGAA ACACTAAATG TTCATCATTA 180 

TTGGTAAACC AGCCGGAAGC ATATG CATTG GCATTAGGTT CACCAATTGT TGAAGATAAA 24 0 

TTTTTATGTG ATTGTTGAAC TAATGATTTG TATTTATCAG GTGGATTAAG TTGGAATTTT 300 

ATCGAATGTT CCAAATCTTC AGTTGATGTC AT CAT ATATG CTGATGGTGT ATCCCAAAGG 36 0 

TTAAATTCAG G TTT AG AG AC GACAGGTGTC GAACCTTGTA ATTCATAGCC AATAGCATCA 42 0 

TGTTTTGATT TGTAATTGGT TTGTTTGAAT GATGTATGTG TCATATGCAA AGGCTTGAGC 480 

CATGAATTTG TAATATATTT TGTATAGGAT TGCTTCGTAA CGTTTTGGAT AATTAAACCT 54 0 

AATAAATCAT AGTTCATATT TGAGTATTCA AATTCTTCTC CGGGCTTATG ATGTAATTCA 600 

TCACCCATAA TTGCATGGGT TACATCATTT AAACGATTAT TTTTGCTTGT CACAGAATCT 660 

tCGCTTGTAA TAtCACTAGG TATACCACTT GTTTGAGCCA AAAGTTGCTT AATCGTAATA 720 

GTTTCATTTT GACCATTATA GTTCATTTTA AAATGAGGCA CATGTTTGGA TACGGCATCA 78 0 

TTTAAGTTTA ATCGACCTTC TTGAGCTAAT TTTAAAATTG CAAGACCTGT GAAAGCTTTC 84 0 
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TGATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 96 0 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 102 0 

5 TCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 10 56 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS : 
w (A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 
ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 6 0 

ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAACTTAT GTTAATCGAT 12 0 

20 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 
ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 24 0 

25 TATGATTTAT TCCAACATTC ATCAACTAGA AACATTAAAG GTTATAACGA ATTAATCCGT 3 00 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 36 0 

GATGAGCTTG CAG A TTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 4 20 

30 ATTACACAAA TGGCACGTGC AGCAGGTATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 54 0 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 6 00 

35 AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 66 0 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 72 0 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 7 80 

40 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 84 0 

TTGTTACAAC GCCAATTTaG AATTGGtTAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 
GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 96 0 

45 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 102 0 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 10 80 

5Q TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATT CAG C A 114 0 

TTGATTACTG AACAAGTCAT TAAAGATAt T TTGAAGAGGG CACAAGTGTT AAGTCACTGC 12 00 
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GrtTTGAATG CGGAACT 1277 
(2) INFORMATION FOR SEQ ID NO: 257: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 5 0 

15 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TT AT AC ACT A TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

20 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 3 00 

ATAATTTTTA AAAGT AG TAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 360 

?5 TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 42 0 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

30 CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 6 00 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 660 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

J5 CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 780 

ACAG&TAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 840 

TGTACATGGT A CG TATTTG A CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

40 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 96 0 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 1020 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTT CTG AAA TGAATTACAC ATATGGTCCA 1080 

45 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATT CATC CAT 114 0 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 1200 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 126 0 

50 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 132 0 
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40 



50 



ATTTAATTTG TTATACCAGT ATTTTACGCT TTTTCGTCTA CAT ATA C AAA TTTATATTAA 14 4 0 

ATAAAGCCCA ATACAATTTA GGTTAATTAA ACAAGTTGAT AACTATTTAA TTATTCCTTC 1500 

ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 1560 

CTGaTTATGT TCCATGGAAT ACATTTATAA TATAGCCTCC TAATTAA r AT GCyTTGTCTT 1620 

GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArArn CTGrACTTCA 1680 

ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 174 0 

AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 18 00 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCG GTTAGAAGCA 1860 

CCAACACCTA AGATTGATGA TGACACAACG TGAGTTGTTG ATAATGGGAA ATGTAGCGAT 1920 

GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 1980 

ATTTTCATAA TATTACCACC TACAGTTTTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 204 0 

AGCCCCATTG cTGTCGCACA GGCAAATTTT ACCCATAACT GTGGTTCAAC ACTGCCATCA 2100 

TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGCGCATCA 2160 

TTCGTACCGT GAGAGAATGA TTGTAACGCT GCTGTGAAAA TTTGGAAAAA TCTAAAGTTA 222 0 

CGATTCGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTGA ATACATCAAG 22 80 

AAACCAACAC AAAATG C GAT AATCGGTGAA ACGATTAATA CAATAATAAT TTTTGTGAAA 234 0 

CCTTGGTAAT GTAACACTCC AAATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 24 00 

CCTATAAGTG CATGTGAAGA CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 24 60 

ATAATAGCCG CAAGTATTGC AGCTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2520 

TCGACAATGT CTTTAGTAAT GGTGCCTGCA ACGCCCGTAA ATGTTAAAGC ACCTATAAAG 2580 

TTCATCACTG CTGCCATTAA AATTGCCGTT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 264 0 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAGCGAGAAA 270 0 

ATAACTACAG CTATAGTGAC GATGATTATA TATGACATAA ATATATACTC CCCTTAGCTA 276 0 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCGATTTCT 2820 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTGGATCTGT TTCGCTATTG 28 80 

AAAATATGTT TAATTGACTG TCTTAAAATA CCATCACAGT TTGTTTCAAA TTCTTTAATA 2 94 0 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 3 0 00 

TTCATTTCTG CAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 3060 

TATTCGATTG AATACATTTC AAACATrGCT GCCGTTTCTT CAATTGCATC TAAAACATCA 312 0 
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TTTAAATCAG TAATTACTTG ATGTACTAAt tCGcACCATG TGACTCATAA GTTTTAATGT 3 24 0 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 3 3 00 

5 CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 3360 

TACT AAA CAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 342 0 

TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 34 8 0 

10 GTCAGTATAA TTTTAGTGTA CTGATATTAA TTTCAAAAAT GCCTCACAGT AAACAATTTA 3 54 0 

CTGTATTTGC CCTTATA 3 557 



(2) INFORMATION FOR SEQ ID NO: 258: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 
AACTATACAT TTCGAAAAAT TCTTCTAGTG AACCTGCGCC ACCAGGAGCC ATGACAAATG 6 0 

25 

CATCTGCAAG TTCTGCCATT TTATTTTnAC GTTCATGCAT AGAATCAACT AAAATTAATT 12 0 

CAGTTAAACG TTGGCTTGTG ATTTCATGTT CATCTAACAT TTTAGGCATG ACGCCAATAG 180 

J0 CTTTGCCGCC ATGATCTAAT ACACCATCTT GaATGGCACC CAT AATG C C A ATTGACCCTG 24 0 

CACCAAATAC TAATTCATAA CCTTGTTCAG CAAAATATTT ACCTAAATCG TATGCTTTTT 3 00 

GTACATATGA AGGGTCATGA CCTTTGCTTG CACCACAATA AACTGCGATT CGTTTCATGT 360 

3$ TAATCCAGCT CCTTAATTCG ATGAATGACT TTTAATAGTG ATTGTTCAAA CACTTTTTGA 420 
TCTTGCTTTG TAAAAGGTGG GGGACCTTTG TGGCGACCAC CTTGTTTTCT AATTTGTGCA 480 
TTCATATATC GTTTATCTAA TAGTTGTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 54 0 

40 TGATG CATG A CAATTAAGAC TTTGTCGACT AATAAACTTG CGAGTC CAT A ATCTTGAGTG 600 
ACTACGATAT CATCCTTCGT TGATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 660 
TCATCAACAT ATAATGTTGA TACATGTGGA GGATATAATT GGTTCGAAAA ATGGCTGAAG 720 

45 

CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGTTGTTA AATCTATAAT AGAATCAACA 780 
ACAGGACAAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTCT GTATTGTTTT 84 0 

CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATATTTTTTA AATGACTTGA 900 

50 

TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 96 0 
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CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 114 0 

5 GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 120 0 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 126 0 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

W ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 13 80 

TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 144 0 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

15 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 15 6 0 

TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 1620 

2Q AATTAnAAAA A 1631 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 64 5 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



35 



40 



50 



CGAAATCATG 


ATTTAATGCT 


TTTTCATATA 


AGCTTTTCCA 


ATTAATCTTT 


CGTCCATGAT 


60 


ATTCTTCAAC 


TGTTGCTAGA 


TATTGTGCAA 


TTTTAGTTAC 


TTTAAAGGAG 


TGTGCTGCAA 


120 


CaTTGTGkTC 


nAAATATTTA 


AATTTTCCaG 


GtAATCTTAT 


AAGTCTTTCC 


a TAT CTGATA 


180 


ATCTtTTAAA ATATTGATGT 


ACACCCATTT 


CAATTACCTC 


CTCCATTAAT 


TAATCATAAA 


240 


TTATACTTTC 


TTTTTACATA 


TCAATCAATT 


AAAT AT CATT 


TaAATATCTT 


CTTT a TATAA 


300 


cTCTGATTAA 


ATGATACCAA 


AAAATCCtCT 


CAACCTGTTA 


CTTAAACAGG 


CTAAGAGGAT 


360 


AGTCTTGTCT 


TGATATATTA 


CTTAGTGGAT 


GTAATTATAT 


TTTCCTGGAT 


TTAAAATTGT 


420 


TCTTGAAGAT 


TTAACATTAA 


ATCCAGCATA 


GTTCATTTCA 


GAAACAGTAA 


TTGTTCCATT 


480 


AGGGTTTACA 


GATTCAACAA 


CACCAACATG 


TCCATATGGA 


CCAGCAGCTG 


TTTGGAAAAT 


540 


AGCGCCAACT 


TCTGGTGTTT 


TATCTACTTT 


AAATCCTGCA 


ACTTTTGCTG 


CGTAATTCCA 


600 


GTTATTTGCA 


TTGCCCCATA 


AACTTCCTAT 


ACTTCTACCT 


AATTGTGCAC 


GACGATCGAA 


660 


AGCATAATAT 


GTGCAGTTTC 


CAT AAG CAT A 


TAAGTTTCCT 


CTGTTAGCAA 


CTGATTTATT 


720 
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TACATTAACT GTCTTAGTTA CTGCTTGCTT 
TGCTTGTTGT ACAGGTTGTT TTACTACCTT 
5 TTTAACCGCT TTAGTTTGTT TCACTTTATT 

AAAAATTAAA GGTGTTACAC CAGGATTGTA 
CTCTAAAGCA ATCTTATATA ATGAATCGCC 

w 

CGTTTGTGCT TGAACATTTG ATACATAATT 
ATGTTGTTGT GCATGTGCTG CATTATTTAA 
CACTGTAAGA GATTTTTTCA TCTTGCTGTC 

15 

AAATACTATA GCACAATACA TTTTGTCAAA 
TTAACAATAT AAAAAGAATA CAACCTTTTA 
ATTGAGAACG TTTTCAATAA TTAATTCAAT 

20 

AAATGCACAC TGCATATGTT GTTTTAACAA 
TATCTACCGA ATTGTACTTA AAT ATT CAT A 

25 ATTTATTTAG CTACCTTTAA AAAAACCAAA 

AAATATATAT TAAT CGTGCG ACATTGTCTG 
GTCTCCATCC AT CA CTG CAT CAACCTTACC 

30 CATTGAGTAT GGATGGAAAA CATATGATCT 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC 
AGACTTTAAC ATTTTCATAG CTGCTTCACG 

35 ATTAACAACT ATACCTGAGG GGTGGTGGGT 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA 
TTCAATCTCT ATTTCATCAT TATTAAAATC 

40 

ATGACGACGT CCTGATGAAT CAAATGGAGA 
AGCTTTTAAA TAACCATAAG CATTATGCCC 
CGCTTCATCC CCAGGTAGAT AAT CAACAG T 

45 

ACGTTGATAC ATTCTAAATA GCATATTAGC 
AGGATGTAAC TCTAGAATTG CGTTATTGGC 
50 TTCGTATTCA TCCACTTTAG CCTTAAAATT 

TTCTTCATCA AATTCTTCTT GTAATAAATC 



AGGTGCTTGC TTAACTACTA C TTTTTTAG A 84 0 

TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 90 0 

TTGAGGCACA AGTGAAATCA CGTCACCAGG 96 0 

TGAATATAAT TGATTCAACG TTAAGTGATG 1020 

AGCAACTACT GTATAAGTTG TCGGTGATTG 1080 

ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

ATTCCTTTGC TGTTAGTATT TTAAGTATGC 126 0 

AGCTATTGTT ATAACGATGT AATCAAATGG 13 2 0 

TCATAGTGTA AAATGTATTC ATACCATGTA 1380 

ACCTTGAAAA TCGCCATAGG TAATATTACT 14 4 0 

ACACAACTTT TAAAAAATAT ATTCTAACTC 1SO0 

AACAAATCAT ATTCCAAAAT CTAATTTACA 156 0 

AACCGACGCC CTTTTAGAGC CTCGGTTTTA 1620 

TCTTAAATAT GATTCGATAA ATGGTCCAAT 168D 

TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

AATTTGGCTT CCCCAGCCGA TTTCTTTTTG 180 0 

CTGCTCTTCC AATTTTAATT GATATAATTT 186 0 

GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

AATTCGTATT GCCGATTCAG TTTTGTTAAT 198 0 

TGTATCAACT GTAATATCAT CCGGATTGAT 2 04 0 

TGGAATAACG TCGCATGATG CAAATGATGT 210 0 

AATTCGTACT AGTCGGTGTA CACCTTTTTC 216 0 

TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

TTCAACTTTA AAGCCTTTCT TCTCACAATA 2280 

CCAATCTTGA GACTCCGTGC CACCTGCACC 2 34 0 

ATCGTGAGGC CCATCTAATA ATAATTGCAA 24 00 

AATGACCTCT TGCTCTAAGT CTTCTTTCAT 24 60 

CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2 52 0 
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TTGCGCTTTC GTTTGGTTAT CCCAAAAATT AGGTTCTGCC ATCATTTCTT CATATTCTTG 2 64 0 

AATATTAGTT TCTTTGTTCT CTAAGTCAAA GAGACCCCCT AATTTGTGTT AAATCTTGAT 270 0 

TATACTTATC TATATTTCGT TTGATTTCTG ATAATTCCAT AGcATTCGCT CCTATTTATA 276 0 

TTTCAATTCA AGTCATTGAT TTGCATCTTT TATAATGCTA AATTTTAACA TAATTTTGTT 2 820 

AAATAACAAT GTTAAGAAAT ATAAGCACAC TGACAATTAG TTTATGCATT TATTGTTAAA 2 880 

10 

AATCAGTACA TTTATCATCG ACATATGCCT AAACCGATTT TTTAAAACTA AGTACATAAC 294 0 

AACGTTTAAC AACTTCTTCA CATTTTTTAA AGTATTTAAC GCTTGTAAAA TAAAAAGACT 3000 

CCTCCCATAA CACAAACTAT AGGTGTTTAA TTGGAAGGAG TTATTTTATA TCATTTATTT 3 060 

is 

TCCATGGCAA TTTTTGAATT TTTTACCACT ACCACATGGA CAATCATCGT TACGACCAAC 312 0 

TTGATCGCCT TTAACGATTG GTTTCGGTTT CACTTTTTCT TTACCATCTT CAGCTGAAAC 3180 

, Q GTGcTTCGCT TCACCAAACT CTGTTGTTTT TTCACGTTCA ATATTATCTT CAACTTGTAC 324 0 

TACAGATTTT AAAATGAATT TACAAGTATC TTCTTCAATA TTTTGCATCA TGATATCAAA 3300 

TAATTCATGA CCTT CATTTT GATAGTCACG TAATGGATTT TGTTGTGCAT AAGAACGTAA 336 0 

25 GTGAATACCT TGACGTAATT GATCCATTGT GTCGATATGA TCAGTCCAAT GGCTATCAAT 342 0 

AGAACGAaGT AAAATCATAC GCTCAAACTC ATTCATTTGT TCTTCTAAGA TATCTTTTTG 34 80 

ACTTTGATAT GCTGCTTCAA TCTTAGCCCA AACGACTTCG AAAATATCTT CAGCATCTTT 3 54 0 

30 ACCTTTGATA TCATCCTCTG TAATGTCACC TTCTTGTAAG AAGATGTCAT TAATGTAGTC 3 60 0 

GATGAATGGT TGATATTCAG GCTCGTCATC TGCTGTATTA ATATAGTAAT TGATACTACG 366 0 

TTGTAACGTT GAACGTAGCA TTGCATCTAC AACTTGAGAG CTGTCTTCTT CATCAATAAT 37 2 0 

35 ACTATTTCTT TCGTTATAGA TAATTTCACG TTGTTTACGT AATACTTCAT CGTATTCTAA 37 8 0 

GATAjCGTTTA CGCGCGTCGA AGTTATTACC TTCTACACGT TTTTGTGCTG ATTCTACAGC 3 84 0 

TCTTGATACC ATTTTTGATT CAATTGGTGT AGAGTCATCT AAACCTAGTC GGCTCATCAT 3 90 0 

■40 

TTTCTGTAAA CGTTCAGAAC CAAAACGAAT CATTAATTCA TCTTGTAATG ATAAATAGAA 3 960 

GCGACTATCC CCTTTATCAC CTTGACGTCC AGAACGACCA CGTAACTGGT CATCAATACG 4 02 0 

ACGAGATTCA TGTCGCTCTG TACCTATTAC TGCTAAACCG CCTAATTCCT CTACGCCTTC 4 080 

45 

ACCTAATTTG ATATCTGTAC CACGACCAGC CATGTTAGTG GCAATAGTAA CGGCACCTTT 414 0 

TTGTCCAGCG CCTGCAACAA TTTCAGCTTC ACGTTCATGA TTTTTCGCAT TTAACACAT C 4 20 0 

50 ATGACGGATA CCACGTTTTT TAAGTAAATT TGAAATATAT TCAGAAGTCT CAACTGCAAC 4 26 0 

AGTACCTAAT AGCACTGGTT GCCCTGCCTT GTGTTTTTCA ACAACATCTT CTACTACTGC 43 2 0 
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TTTATTTGTC GGAATTTGAG TTACTGTCAT 
AGTTTTAGCT GTACCTGTCA TACCCGCAAG 
5 TGTAATAGAC GCCATAGTTT TAGATT CATT 

AGCTTGGTGT AAACCTTCCG AGAAACGACG 
GACAATTAAT ACTTCGCCAT CAACAACCAT 

10 

CGCACGTAAA GCTGTGTTGA TATGACTAAT 
TTCAACTTTG AACATACGTT CAGCTTTATC 
CGTTTTTTCA TCGTATTTAT AATCTTCGTC 

15 

TTGTGTATAA AGTGACGTTG ACTTTTCAGC 
CTCGTCGATT AAAATTGAGT CAACCTCATC 
TCTATCTTCA GAATAATTCA CCATGTTATC 

20 

AGTACTGTAA GTAATGTCTT GTGCGTATGC 
TAAGTTTAAT CCGACAGTCA AACCTAAGAA 

. ?s AACACTTGAT AAGT ATT CAT TGACTGTAAT 

TAAGTATGTT GGCATTGTCG CTGTTAATGT 
ATCACCTTTA TGAATTGCAA TACCACCCAT 

30 TACACGTTTA GAGCCTTCTC TAACAAGTGC 

ATCATTTTGC TTTTTGACAT TATCAATGTC 
ATTACGAATT TCTTCATCAG TTAAAATTGC 

35 AGCAAGTTTA CCTAACTGTT TAATTTCTTT 

TCCCftTTTCG TTCGCTCCTT TAGCTAAAAA 
ATTTATAGTT AGAAAATTAT ACTTATTTAC 

40 

TGACATACTT CATT CA CATT CTGTTGTCAA 
TACGCAATTA TACTTCCTAA TAAATTATAT 
TATACAATAG ACTGACATAC TATTATTAAC 

45 

TTAGCATAGT CACTATGTTT CATTCAACAA 
ATTTGCTTTT GGAGTATAAA AAAGCACTTG 
TATTGTTCAC TAGTTTGAAT CAAGCCATAT 

50 

GTTCCATCAG TTTCTCTGTC TGTGAATACA 



GTTATAAATA TTTCTAAATT CTTCTTCTTC 444 0 

TTTATTGTAC ATTCTGAAAT AGTTTTGGAA 4 50 0 

TTGAATTTGA ACGCCTTCCT TCGCTTCAAT 4 56 0 

GCCTGGCATT GTACGTCCTG TAAATTGATC 4 620 

ATAGTCTACG TCACGTTGTA ATGTAACGTG 4 680 

AACATCAACA TTTTGTACAT CATATAAGTT 4 74 0 

CGCACCTTGT TCTGTTAAAT GTACAG CTTT 4 80 0 

CTGTTTTAAC ATTTTCGCAA AAACATTTGC 4860 

TTCACCAGAA ATAATTAATG GCGTACGTGC 4 92 0 

AATGATTGCA AAATGTAATG GACGCATTAC 4 98 0 

TCGTAAGTAA TCAAAACCTA GCTCATTATT 504 0 

TTCACGTTTT TCTTCTGTCG TCTTACTGTT 5100 

GTTATATAAC TCAGCCATTT CTTCACTTTG 516 0 

AACGTGAACA CCTCTACCAG CTAATGCATT 522 0 

TTTACCTTCA CCTGTTCTCA TCTCAGCGAT 52 8 0 

AATTTGAACT TTATATGGTG T CAT ATTG AA 534 0 

ATATGCTTCT GGTAAAATTT TATCTAAATA 54 0 0 

AGCTAATTCT GTTTGGAATT GTTTCGTTTT 54 6 0 

CGTTTTTTCT TCTAAAGCGA TTACTTTATC 5 520 

ATTATTGCCA TCAAGAATTT TTGATAAAAA 5580 

ACTGTTTGGC CTACAACAAT ATATCTTATC 564 0 

TCATTTGTAG AATCAATATA AATATATTTA 570 0 

CAAGTTTATC ACTAATAAAT ATATTCTCAA 576 0 

TATAAATATT TTACGATTTT CGACTCGGAC 5 82 0 

TTAACATTCA AATATATACA TCCATTAACA 58 8 0 

ATTACATTAT CGAACTATGA AAT AG T CAT A 5 94 0 

TGCAAAAACA CAAGTGCTTT AAACTTAATT 6000 

TTACCGTCTT TACGGCGGTA AACGATACTT 6 06 0 

AAGAAGTCAT GACCTAATAG ATTCATTTGT 612 0 
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w 



20 



25 



40 



45 



SO 



ATCTCGTTAT CATCGTAAGC G7CATTATCA ACTTGTGTTT CTTGCATTTC TTGTAATTCG 624 0 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTTATAT 63 0 0 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 63 6 0 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 64 2 0 

GCTGAATTTG AATAAGTTTT AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 64 80 

TAACGTTCCA ACTTACCAAT TTTTTCClcCA ATATAGTTGC GAATAGCATC TGTGATAGTG 654 0 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6 6 00 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 664 5 
(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAt CTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 6 0 

AAACGGTGTT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 12 0 

30 AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG AATAAAGATC AACTAAAGGA 180 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 24 0 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATT AATGATG GTAAACAAAG 300 

35 TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 360 

TAACAATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 42 0 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 480 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 54 0 

TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 600 

AGAACCAACA AAGGACAATG ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 660 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 720 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 730 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 84 0 

TAAAATGAAC ATTCG CCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 900 



55 
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TCGTGTAGAT TATGGTGCAG CACATGCAGC 
ATACGAaGGA ATTTACAAAG ACTGGAAACC 

5 TATGGGCGGT CAAACGATAC GTCAACTAGA 

AATAGAGTAT CAAAAGAAAC ATGGTGGcGA 
CAATATGATT TCATCAATTA CTACTTTAGG 

W TTTAGCTGGT AATGAAGCTT TAGTGAGACA 

TAATAAAAAT TCAAGAGTAG ACTTCGGGTT 
TGAATCATAT ATTGATTATG TCAAACGCGT 

15 

TAATGGATTT TACGATCTGA CGCGTGAGGG 
GAACCCTAAC ATTGTGTATA AAACATACAC 
CGATAGACAA AAAGCAGACT TAAATATGTT 

20 

CGGTAAAGCT ACTGAAAAAG AATGGCGAGA 
TCAACATCCA TTTAATCAAG CTTATACAAA 
GCAAGTGACG CCTACAAAAC ATGATTGGGA 

25 

TGATACAGTG CG CACAAG AG AAGAATTACA 
AGTGAAAACT GAAAAGCTGA CTGATACTAA 

30 ATTAATAATG ACTGCAGACT TCTTTCAATT 

ATTATTCAAG TAAAACATTG GCGAGGCCCC 
AAACAATGCA AGTTGGCGGG GCCCCAACAA 

35 TGTGCAAGTT GGCGGGGCCC CAACATAGAA 

CAAGJTGGGG TGGGACGACG AAATAAATTT 
TTGGCATTTA CGAAGTTTAA ATGTGCAATT 

40 

GGTAAAACGA AGTCAGTCAA TTCAAACTGA 
TAATAATTAA TAAGCATTAA TGTTCAATTT 
GATATAGAAT TCACGCAATA TCATGCCGAC 

45 

TGATATCGCT AGAAAGACCA TTGTATTATC 
GTCTAGAATG ATTGGATGTA ATAAATAAAT 
CGTATTAAAT AACATTGTTT TAAAATGCGT 

50 

ACTATTATAT GGTGTTAATG AATATGAAAA 
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AAAATATGGA CATGAACGTT ATGGAAAAAC 1020 

AGGACAGAAG GTACACCTAG TTGGACATAG 10 80 

AGAATTACTG CGTAATGGTA ATCGTGAAGA 114 0 

AATTTCTCCA CTATTCAAAG GTAATCATGA 1200 

AACACCACAT AATGGTACAC ACGCATCAGA 126 0 

AATCGTATTT GATATCGGTA AAATGTTTGG 1320 

GGCTCAATGG GGTCTAAAAC AGAAGCCAAA 13 BO 

TAAACAATCT AATTTATGGA AATCAAAAGA 144 0 

TGCAnCAGAT TTAAATCGTA AAACGTCGTT 150 0 

TGGTGAAGCA ACGCACAAAG CATTAAATAG 156 0 

TTTCCCATTT GTGATTACTG GTAACTTAAT 162 0 

AAACGATGGT TTAGTATC CG TTATTTCTTC 16 80 

AGCGACAGAT AAAATTCAAA AAGGCATTTG 174 0 

TCATGTTGAC TTTGTAGGAC AAGACAGTTC 1800 

AGATTTTTGG CAT CATTT AG CAGACGATTT 1860 

GCAAGCATAA TTTATAAAGT AAAGGGAGGA 1920 

AATCGGATCA TTATTTAGAA TTCTAAAAGA 19 80 

AACATAAAGA ATTTCGAAAA GAAATTCTAC 2 04 0 

AGAAGCTGGC GGAAAGTCAG CTTACAATAA 2100 

GCTGGCGGAA AGTCAGGTTA CAATAATGTG 2160 

TGCGAAAATA T CATTT CTGT CCCACTCCCA 222 0 

AGAATATATG TATAACAATA TTAAACACGC 228 0 

TTTCGCCCAC CGCGTGTTTT TAACATAGCT 234 0 

ATATGGTTGT TTTCCAATAA TAAACCTAAA 2400 

ACCTATACAT AATCCTAAAA TGAATAGTAG 24 6 0 

CTCAAATATA TTTGTATATG CAAACAATGA 2 52 0 

AAAGAATGAG AAAGCACTAA TCATTTGAAT 258 0 

GCAAATACCC AAGATAACAA TAAACATAAT 2 64 0 

GCTGGTAACG TTCCAATAGT CTCCaTTTGT 2700 
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TCTAAGAAAT TTAATACACG TTCGTAGTTA T AA C C CAT AT ATGCACCTAA GAAGAAATAA 2 82 0 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 2 88 0 

5 TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 2 94 0 

AATAATAATA TTTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACTCAAA 3000 

ATAAAGAATT GCATGATAAC AACGATAAAA TAG C CAT AC C ATTGACCTAA TAGGACATTT 3 06 0 

W TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 312 0 

AATCCCATTA ATATGTAAGG AATAAG TATA TATTTTACGC GTGTAGTTAA GTATCTATAG 3180 

GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 3 24 0 

15 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 33 00 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 3360 

GCACGTAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 34 2 0 

20 

TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 34 8 0 

CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC 3 54 0 

05 CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 36 00 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 3 66 0 

AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 3 72 0 

30 TTCAAATTCC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 3780 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3 84 0 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TAT CATC AAA 3 90O 

35 GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTG t AAT AT A AAAATTCTTT 3 960 

CAAQGTTAAA AATTTAGCAT CATGTGATTT TAGCCATTTT ATTTGAGATT CAAATTGTGA 4 020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4 0 80 

40 CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 4140 

TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 42 00 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 42 6 0 

45 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 4 3 20 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 43 80 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA AT AT AG T AC C AATATAAACG 4440 

50 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4500 
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TTGCGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGACAAGA ACTACTGCTG 462 0 

CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATATTCT 468 0 

5 TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 4 74 0 

T CAT AGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4 800 

AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 4 86 0 

10 

TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 4 92 0 

TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 4 980 

AAAGACCTCC CAATGTTTCT GGAACCAACA TCCAACACAT GGCAAGCGGT TCATACTTAA 5 04 0 

IS 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG ACACCCGAAA 516 0 

„ Q TAGTATTGAC TGCGCCAGCA AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAACTTGCAT 522 0 

ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 52 8 0 

TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 534 0 

25 GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 54 0 0 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 54 6 0 

TATTTTCTTT GATTTTATAG ATGAGTTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 5 52 0 

30 TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 558 0 

TTTCACTTTC GTTATAACAG GCAAGTAAAA ATGTAATGCC TTCTAATTCA TCCACATTTA 564 0 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTGGT AAAATAGAAA TAAATTGAAC 5700 

35 CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 5760 

ATTTCTTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 5 82 0 

AATTTGCAAT AGATTGTTGT TATAATTAAA CGGAAATATT TGTAATTGCA ACTTAATTTT 58 80 

40 

CCTGTAACAT AGTGTGATTA ATTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 594 0 

TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATGA CGGTACAACA CTTGATGATA 6 0 00 

TAG CTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCGAAAAAAA 6060 

45 

GTATTTACGA ACAAAGTGTT AAATGTTGTT TTGATTACCT TAATAATATT ATTATGATGA 6120 

ATCAAAATAA ATCGAACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 6180 

TCGACATCGA AGAAAGGTAT ATTAGAATGT ACGTTCAATT ATCTAATACG CCTGAGGAAT 624 0 

50 

T TT CTGG AAA TATTTACGGA CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 63 00 

55 



1061 



EP0 786 519 A2 



TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 64 2 0 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCGCT ACTAAATATA TTTTTGAAGA 64 80 

5 AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA GACATGAGAT 6 54 0 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAATCGA TATTGAGAAT ATTAAGATGT 66 00 

ATATGAATTT TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 6660 

w 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 6 72 0 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAGAAT AAGTTGTCAT CTAATTACAG 6780 

1S GAAATCCGCA TAAATTAGAT GAAATGGAAA GTAATAAGTA ATAATTTATT GATAAGCGCC 6 84 0 

TATGTGATGG TAAATCATGA CATAGGCGCT TTTTTTTATA AGTTAAAAAT GTAAATAAAA 6900 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 6 96 0 

20 CACAGTGACG GGTTACG CAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 7 020 

AG AG CAAAT A ATCAAGTAAT GATGATGCTT CTACTCGATT ATAAGAAAGC CATGATAGAG 7 0 80 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 714 0 

25 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AGAATCATGA TATAATTTTA 72 00 

AACGATAAAA TATTTATATA AATAATTAGA GaAAATGTAG TTGTGTATGT yTTGTGGtCG 72 60 

TTAAACTAGA TATAATTGTC CGATTTATAA AACATACATA ATGAATACaA TGATTGATTA 73 2 0 

30 

TGTGGAGGAA ACCATGAmAG AmAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 73 80 

GAATATTAAA TTATTGCTTA TTTTACCGGC AATATGTCTT GTAGTAAGTG 74 3 0 

35 C2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4082 base pairs 
Z (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAGCTAATTG 6 0 

ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 120 

TAAAAACTTA G T ATT CC AG T TGCTTATTAT AT CATTG ATG AAAGGCTGAA ATAAAACATA 180 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 24 0 

TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC CTTTTGGTGT 3 00 
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AGGCGTCTGA ATAATTTGCA ATAAAAATGC 
AAAGGTTCGA TGTCCAAAGA TCAATCTCAC 
5 AATCTTACTT TGAGTGTGAC ATATATCGGG 

AACCGTAACC GACGAAAAAA TATCCGTTTG 
CATC CCG CAT GAAGCATGTG TTTTACCTGT 

10 

TACCACATCC CTAACAAAAA CACGAACATA 
AATTTAAAAA AG TAT C A CAT AACTCTTGAA 

?5 ACAAAATATT TAAAGGATGT GTTTGAATAA 

AAGTGCGTAT TCAATGGAGA GTTGCTGATA 
TTACACAAGA CCAAGATATT CATGCAGTTC 

20 TCGGCTCAAC GTTTGGTAAA ACGAATCGCG 

TTATTTATAC TCAAAATGAT CAAAAGGTTT 
AaAATCATTC ATGGTGAGGG CTTCATGAAT 

25 

AAACAAATAC AAAAGGACAA CTGTTCCCAT 
TTTATAACAA AAGATGTGCC ATCAAAGAAA 
AGAAAATCAT AAACAATTTG CCG ATGCTT A 
CTACTTCAGA CAAGTATATT AACTGCGATA 
TATCATTTTG TACACCTTCA ATTAATATAG 

35 TAATCGTTTG AGAAGCCTGT GCCGCTTCAG 

AAGGTTTACC CAATATGACA AAAAAGGGCG 
CAATACTCAT TACTACAGGC AAAATAACAA 

40 ACTTAAAAAA GTCAATAACG CCCGGTGCTT 

CATGTCTCAA TGCCGTCTTT CCTTCTGGCA 
CCTTAGCATA CTCATCAGGA ATTTTATTTA 
CAACGAGGCA GGATACTATC ACTGATAAAT 
CTGTTTCAGC AACTACAATT GCAAAGGTGA 
CTGTTGCCTC ACGACGAGAA TAATATCCTT 

50 

CAACAGTTCC GTCTCCAATA AATGATGCTA 
TAAATAAAGG TCTCATAACC GGTCTAAAAA 

55 



TATGATTGCG ATAAATAATA TTGAATGCGT 420 

AAAAAAACTA ATTACCTTAA ACCTTCTGCC 4 80 

TAATAAGCTA GCTAGAGTTG CTAGAATGAT 54 0 

AAAATATTGT GTTGTTAGCG CTCCAACGAG 600 

CATATTTGTT CTCCTTTAAT ACTCACATTT 660 

TTTTCGGGTT AAAATTCATT AGTATGACAC 72 0 

AACGATTACA AAATCGTTTA TGATGTATTT 780 

TGGCAATGAC AGTAAAAAAG GATAATAATG 84 0 

TCAAAATTCC TACAAGTGAA ATTAAAAATA 900 

CTAAATTAGA CAGCAAAGAT GTATCTAGAA 960 

TTATTATCGA TACTGAAGAC CACGAATACA 102 0 

ACAATGAATT AACTAAATAA ATTGTATAAA 10 80 

GATTTTTTTA ATTGATTCAA CACCCAGCAT 114 0 

AATTTTAACA GTTGTCCTTT TTCACATATA 12 0 0 

TAATTGGTAG TGTAATGATT GTTCTAATCA 1260 

CAGGAATCTT CGAACCAAGT ATGACGCCAC 13 2 0 

TACTAAGTGC CCCAATAACA AAA CGAG TT A 13 8 0 

AAGGTAAAAA CATATCGGCA AAACCGATAA 14 4 0 

GTATTTGCAT TAACTCTAAA AATGGAACAA IS 00 

TGTAGTTCGC AATAATGGTA GCAATAGTAC 156 0 

ACCACATATC AATGACTGTT TTTAATCCTG 162 0 

TAATACCTAC TTCTGTTGCA GTATCAAAGC 168 0 

ATGCCTCAGT ACGCGCACTT TCAGGTACCT 174 0 

AAGGC CAAAT TCTTGGCATA ATGACTGCTG 180 0 

AGAAAGCAAA AAATTGATTT TGCATGTGCA 186 0 

TAGAAACTAC ACTAAATGTC GTTGAAATAA 192 0 

CACCATATTG TCTACTTGTA ATTAAGACAC 198 0 

AATTATCTAC CGTCGAACGT CCTGG CAATG 204 0 

TAGGACCCAA CATCTCTAAC AAACCGTATT 2100 
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AACTTGAGAA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA C CAATTCGT A 2 22 0 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2280 

5 CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 2 34 0 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 2400 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 24 60 

10 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCG 2 52 0 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2 560 

/5 TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 2 64 0 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 270 0 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

20 TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2 820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2880 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 2 94 0 

25 ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3 0 00 

TGAGGTGCAA TTAATCCTAA AAAG CTAAT A CCACCGACAA CTGAAATTGC GGATCCTGCT 3 060 

AATATTACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 312 0 

30 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 3 24 0 

35 AG ACTAC CTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 3 3 00 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCTTTAACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 34 2 0 

40 AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 34 80 

ACAGCAGCTA AACTG GCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 3540 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3600 

45 AGTAATGTTC TAGGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT GAATATACTC 3 720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 3 7 80 

50 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC ACCTACGAAT GATGTAACAA 3 84 0 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3 900 
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AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 4 020 

CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 4 0 80 

XT 4082 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 2: 



20 



25 



30 



35 



40 



45 



50 



TATTAGAAGG 


TCGTTCGGAT 


GAACAATTAA 


AAAATTTAGT 


TAG CG AAGTA 


ACTGACGCCG 


60 


TAGAAAAAAC 


AACGGGGGCA 


AATAGACAAG 


CAATTCACGT 


TGTTATAGAA 


GAAATGAAAC 


120 


C AAA C CATT A 


TGGTGTGGCT 


GGCGTAAGAA 


AGTCAGATCA 


ATAATTCTTC 


ATAAGATGCA 


180 


TGCCAATTAA 


TTCTTTGAAA 


ACGAACAAGG 


CGACTTCTAT 


CTGaGTATGA 


TAGAAATCGC 


240 


C TTG TTT ATT 


TTTAATCTTC 


ATCTAAAAAG 


TCTTTAATAG 


CTTGTTTATT 


TGTGTTTTTA 


300 




ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTTCTTGATT 


360 


GGCaCAGTCA 


ATGACTTAAC 


ATCTTTTTCA 


CCTCGGATAC 


CaAAACTCAA 


ACCTGTTTGG 


4 2 0 


AAAATCCCTG 


AAT CAGGAAT 


GTTTGTATTC 


ACATAGCCTC 


TTAAAATACC 


TGCAACTTTT 


480 


GGTAATTTAA 


CAACTGTTCT 


AAAATTAACC 


ATTTCTTTTT 


TCAATGTTTG 


CATCACTTGT 


540 


TGCTGACGTC 


GCACGCGTCC 


GAAGTCACCT 


TCAGGGTCGT 


GACGGAATCT 


TGCATAACCA 


600 


AG TAATTCTT 


TACCATTCAA 


CCTATGGTTA 


CCCTTTTTCA 


AAGATACACC 


AATATTTTTC 


660 


GAGffTATCTT 


TTTCGACATT 


AATTGGTACA 


CCTTCAGGCA 


TTAATTCATC 


AATCATTTTC 


720 


TCAAATCCAG 


TAAAATCAAC 


TACTGCATAA 


TATTCAGGAT 


TAATTCCTAA 


ATTTTTATCA 


780 


AGTGTTTTTC 


TAAGTAGCTC 


TGGACCACCT 


AAAGCGTATG 


CTGAATTAAT 


TTTGTGTTTT 


840 


CCATATCCTG 


GAATATCTGC 


ATAAATATCA 


CGCATGACAG 


ACATCATTTT 


CATCTTTTTA 


900 


TTGATAAAGT 


CATATTGAAC 


AACCATGATA 


GAATCTGTTC 


TTGATTGTCC 


ACCTTGTGCT 


960 


TTATCTGCAC 


CGAGTACAAG 


AATAGAAATT 


nTACCATCAT 


TTTTTACTGG 


TCCATTAAAT 


1020 


TGATGTACTT 


TAACATCTTT 


CGCATGTTTC 


TTGGCATATT 


CTACACCGCT 


ATTGTAACTA 


1080 


TGTACAATAT 


ATACAACTAA 


TGCCGATAAG 


TAAAATTACA 


ACAATCAGAA 


GAATGATAGG 


1140 


TAATT 












1145 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: J 

10 

TATGGCTCAT CATTAATGCA CGTATCGGGT 
TAGCACGAGT TCTTAAAATT CCATCGCCAT 
TGTTTTTAGT CACTTCAAAG GCAAAGCTAT 

15 

GCACTTGAAT AACATGTGCT TTTAAAGTTA 
AACTATATGG TCTTGAACCG TATAACGCAA 

20 GTAATAyyyg TCCTTCCCGA TAAAAACTCG 

GATAACCTTC AGATAAAAGT GCTTCAACAA 
AAtCTGACAC ATCGAATTCA TCAGCATATC 

25 TTTTTGCATT TTGATTATGA TGGTGATCTT 

TAAATCCAGA CCGATGTAAT AAATTTTCAA 
CATTTTTATG GTTATAGTAA TATGTCAACG 

30 

GGACTAAATC AAACTCGTAA ACTGCATTCA 
GTCTAATTTG AATTGCTTCT CGTAGTGATG 
TAAACTGAGT TACAGCAAAT TCTAGGTCAT 

35 

TTAATGGCTG ATTGTTTTTG ACTGTGATTG 
ACCATGTTGC TGTCAATGTA TTACACCTCT 

40 TTAACCACGT TTATTAACAC TTTTTCATCA 

ACAAAACCTT TATCTTCATT TCGTGTTCCT 
TGACCATAAA AACTAAAATC TTCCCCAAAT 

45 AAATCAGCTT TTATTAAGGC GTCCTCTATT 

GGGGGATAAC CTTCTGCAAA TTTAACTTCA 
TCTGCTATCT TGTGCATTTG ATTTTTAACG 

50 

ATAGTACCTT CTAAATAGCC ATTACTTGGT 
TGACCAATAT GAACAATATT TCGTTTCAAA 



SEQ ID NO: 263 : 

AGCGTTTACC ATTGATAAGT GCTTCATGTT 60 

AACCGATATC AACTACAGCT AATTTTGTAT 120 

AACCGCAATA ATCACCAGCT TGTACTTCGC 180 

ATGACTGAAC TATATCATGT TGATTCAGTG 24 0 

TACCTACACG CGCATGTGTA TGGTGGGGTA 300 

CACTATTTTG AGCATGGATT AGGTCGAACT 360 

TTTCCATCCA TTGTGAACGT TCAACATTAT 420 

CAAAATGGGk CC AT AAA CCA CTAATAATCA 4 80 

TCAATACTTC TTTAATTTCG TTTAAATCTT 540 

ATTCTAAGTG AACATGAATA CCAGCTAAAT 600 

ACGGCAAAGT CATGTGTATT TGATGTTCAC 6 60 

TTAAAAAGAT TGTTGCATCT GGAGCAAGTT 720 

TTGTGCTAAA TGTATCTATA CCTGCATGGA 780 

AGTGATATGC ATTaTTTTTA ACAACTGCCA 54 0 

CATTTTGTAA AAATATTTTC TTATTTACAG 900 

TTGTAATTAT TTAATAAATT TTCGTAAAAA 96 0 

AAATTTAAAT GTGATGTGTG CAAAC CAGTT 102 0 

ATAAAAACAA AGTAAGCTGG AGCTAGTTGT 10 8 0 

AAGAATGGCG TTGGTTTGTC ATAGACATTT 114 0 

TGAGTACGTA ATTTCGGACT ATTGATTGTA 120 0 

CAATCTACAT TAAACAGAAG CTTGACACTT 12 6 0 

ATTGTTAAAT CATCAATATC ATATGTACGA 132 0 

ACAGTGTTAA TCGCTTCACC AGCTTTAAAA 13 8 0 

CCGTTAAGGT GAAATTGTTG AATTTGTGAT 144 0 
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ACATGACTTG ACAGGCCTGT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 1560 

TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT ACCAAATACC 1620 

5 

GCTTCAATTG GATACTTATC AAAGGCACCG GCTTTTATTA ATCGATTTGC ACCGCCACCA 1680 

GTTTCTTCTG CAGGTTGGAA AATGAAAACG ACATTTTGCG GTAATTGACC TGCATCTTGC 174 0 

ATGTCTTTGC AACGTTGTAC AAAAAGCATT AATGCAGTTG TATGACCATC ATGTCCACAA 180 0 

w 

GCATGCATCA CATGATCAGA TTGACTGCGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 18 6 0 

AACGCATCAA TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 192 0 

15 ATGACGCCAG TTTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 1980 

ATATAAGCAG TTGTTTCAAA TTCATGTAAG CTTAATTCAG GATGTTGATG TAAATGACGG 2 04 0 

CGATGTTTCG TAACAAATTC TAATTCATTC ATAATTATCA ATCCTTTGTG TTAAATTACT 2100 

20 ATATAAATAG TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 22 20 

ATTCAGTTTT CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 22 80 

25 

CTTAATCACT TTTGCAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 234 0 

AATCGCGCCA GCTGcAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 24 00 

ATTTGCAC CG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGG TTCAAT 24 6 0 

30 

CACACCTGCT AATACTGCGC CAGCCCcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 2 520 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2 580 

35 TGTTGCGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 264 0 

ACCTGGTTCA ATTCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 2700 

ACGATCCATT TCAATTTCTA TATCTTCGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 2 760 

40 CCAATCATCC GCTTCACAAA AGATTACTTT AGATTGTTCT GAACCAAATA CTTTAAAACT 2 820 

TTCTGGATAT GTGATGCCTT CAAAATTACC ATTTAAATAT ACTTTTATTG GTGTAGACTT 28 80 

TTTAGCATCA CTTATATATT GAATAATTTC TTCAGCTGTT AAATGTTGTA CCATAAAATA 294 0 

45 

ATCGATCTCC TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA 3000 

ACTAAGCGTT CTGCTGCTTG TATTGCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 3060 

TGCGTGATTT GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 312 0 

50 

GTACCTCCAC GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 3180 

TGTCTATCAT ACACAGGTGT TACATTTTCT TTCAAAGATA CGATCACATC ATACAATTTT 324 0 

55 
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TCGAAATCAT CAAGT AGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 3 360 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 34 2 0 

5 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 80 

AATAAAGGGA AAAG CAGATT TGGATTTGAA AAATCTATTG CAACATCGGC ACCTTTAACA 3 54 0 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 36 00 

w 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3 660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 3 72 0 

j 5 GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 3780 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3 84 0 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3 900 

20 CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3 96 0 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4 020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4 08 0 

25 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 414 0 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4 2 00 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4 26 0 

30 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4 3 20 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 4 3 80 

35 TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 444 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATTAACTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4 560 

40 GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CCAATACTTG TACAGCATTT 4 62 0 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 4 740 

45 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4 860 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4 92 0 

50 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4 930 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 504 0 

55 
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TGATATGTTG TATATGCCAC TCGTTTTAAA CCATAAGCAT CTTG CAATAC TTTTAGAGGT 5160 

ACAACAGATT GAATCGTAGA GCAGTTTGGA TTGGCAATGA TACCTCTTGT AAATGTAGGT 52 20 

5 

TCATTGACTT CCGGAACGAT TAAATCAATA TCTTCTGCCA TACGCCATTG ACTTGAATTG 52 8 0 

TCTATAACGA TTGCACCAGC TTTTTCAAAA AGTGGGGCAA AGTGTTCGCT TGTACCGCCA 534 0 

CCAGCACTCA TTAATACATA ATCGAAATGT TCACTTGCAC GAGCATCAGT TAATTCTTGA 54 0 0 

10 

ACTGTATATG TTTTTCCTTG AAATTCAACT TCTTGCCCTG CAGAACGTGC TGATGAAAAT 54 6 0 

AATACTAATT CATCGAAAGG AATATTTTTA CGATTTAATG TCTCCAACAT TTTTGTACCT 5 520 

is ACTAATCCTG TTGCACCCAC AACTGCTAAC TTTGTCATAA CTTGTCACTC CATTTTATAA 5580 

TAATTTCCaA TTTTTAGAAT ATTTTAACAA TCATTTTACC ATTAAATGTT AAATGCGTCA 564 0 

TATAGTTTTT CTACCGCTTG TTGCCCATTA AAATCATCAA TGACGTATGA AATACTTATT 5700 

20 TCAGATGTTG TTGTTTGGTA GAAAGGTATA TTATTTTCAA TTAATGTCAA AAATGCTTTT 5760 

GATGCCACAC CTGACATATC ACGCATGCCT GAGCCAATTA ATGAAATTTT GACATAATGC 582 0 

TCATTGATTT TATAAGCTAA TGCTT CAT AT TGATTCTTTA ATGTTTCAAG AATCATAGAA 58 80 

25 

ATTTGATGAA AATCACTATC TTTAATCGTG AAGGATAGTT GTAGCCCATC CAAGTTGACG 594 0 

ATTTGTGAAA TCATATCAAC ATTTACAGCA CCTTCTTCAA GTTCCGTAAA TAGTTGGGTA 60 00 

AGTAGCTGAT TGTCAGGTAG GGGATAACTA ATTGTTACAT GCATCATATG TTTATCCAAA 606 0 

30 

GCCACACCAG TAACTGCTTT TTTCTCTAAT ATTTCTTCAT TTGACATAAT CCATGTTCCT 612 0 

TTCACGTTCG ATAAAGTTTT TCCTAAATAT AAAGGGATAT TATAGTTTTT AGCTAATTCA 618 0 

35 ACACTTCTTG TTTCAAGTAC ACCAGCACCT AAAGCGCTCA TTTCCATCAT TTCTT CATAT 624 0 

GAGACGATGT CTAGTCGTTT AGCCTTTGGT AAAAGTCTTG GGTCAGTGGC ATACACACCA 63 0 0 

TCAACGTCGG TATAAATTTC ACAAGGTATT TGATTACTAA CAGCAAGTGC CACAGCGGTC 63 6 0 

40 GTATCAGAAC CACCTCTGCC TAAAGTTGTT AATTCCTGAT GTTCATTGAT GCCTTGAAAT 64 2 0 

CCAGCAACTA CTAAAATATC GTTTTCTTGA AAGGCTTGTT CAAATGTTTG AGGATTAATT 64 8 0 

TGAGCAATTT TACTTTTTAA ATGATGGCCA ATGGTTTTAA TACCCGCTTG ATAGCCAGTC 654 0 

45 

ATTGCTTTGG CATTCATACC GATATCATTT AATACCATTG ATAAATAAGA TACAGTTTGT 66 00 

TGCTCTCCGG TTGTCAATAA TAATGCCAGT TCTTGTTGTT TTGGTGCTTT AGTCAAGGTT 66 6 0 

GATACATTCG TCATTAATTG ATCTGTTGTG TTACCCATAG CACTTACAAC GACAATTAAa 672 0 

50 

TTGTTCATCT TGATTGACTC GCTCCTTTAA CATTTCAGCG ATCCTTTTTA TTTTTGTAAA 67 8 0 

ATCACTGACG GATGATCCGC CAAATTTCAA CACACTTCTT GTTACCATAT AATCCTCCTA 6 84 0 

ss 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6 96 0 

AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 70 20 

5 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7075 
(2) INFORMATION FOR SEQ ID NO: 264: 
(i) SEQUENCE CHARACTERISTICS: 

10 (A) LENGTH; 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 60 

20 TGAACCTGAT TGACTAGTAC TTTGTGATAA TGACTTACTA TCAGAATCAG ATGTACTTTG 12 0 

TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AACTACTTAC 180 

GGACATTGAT TTACTGTCTG ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 24 0 

25 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGCTGCTAT 300 

GCTTAGTGAT CCAGAAACAG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 360 

ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 42 0 

30 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 4 30 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACT AATTG CA TTCGATGTAC TGTCACTTAA 54 0 

35 TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 6 00 

GCTTCCTGAT GTACTCATAC TTAATGAGTC ACTGAGTGAT GTTGATGTAC GCAATGAATC 6 60 

AGATSTACTT GTTGATAGAC TTTCGGATTT TTCAGTACTG CTAGAGTTTG AAATAGAATC 720 

40 GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 78 0 

CACTAAACTT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT 84 0 

AGATGCACTC ACAGAATCAG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 900 

45 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 960 

TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATG CTATT 102 0 

TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGCAT CTGTAGAAAC 1080 

50 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 114 0 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCACCGCAT TTCCACTGTT 1200 

55 
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ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TTGTAAAAGT TGTCGTCGAT 13 20 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 13 80 

5 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 144 0 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 1500 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG GTGCTTTTAC ATCAGTAAAA 156 0 

10 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 1620 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 1680 

, 5 TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 1740 

ATATCTTTAC CTGTTGTTAC ATCAACGTAT CTCACTTGTG TAnCAGCAGA CTCTGTATAT 1800 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 1860 

20 GATAATGAAA AGTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATT ACGTGTCCAT 1920 

GTTTGACCTG CATATTTGAC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 1980 

TCTTGGaACG TGTTATTTGT AGGTTGaACA TTTAACTTCG CAGCATTATC AG CTGTTG AA 2 04 0 

25 

CTTGATGTAT ACGTTGTCGC AACACCATAA CT AT CTGTTG TTACAAATGC ACCAAACGCA 2100 

CCTCCACCAG CTACATTAGA TGGGTCAGCA TTCGCCTTTG CAGCTGAATT TGGTTTAGAT 216 0 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 222 0 

30 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 223 0 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AGAAAAATGA 2 34 0 

35 AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 24 0 0 

TCCTGTGTTA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACGTTGTC 24 6 0 

ATATATTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT TAGCAGTTAC TGCGGTTGTC 252 0 

40 GTCgCTGctG ACGCAAATGT TGA CAT AG CT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2 58 0 

GCGGTGCTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA CCGAAGATGG CGTTGTGCTT 264 0 

TGTGAAGTGT TATTTGATGC AGTACTTTGA TTTGTTGATG TATTAATTGG TTGTTCTGTA 2700 

45 

CTTGAAGTTG AAGCTACAGA TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 276 0 

GATTCTGATG TAGATGTCAA TTTCTCTTGT TGATTGCTTG TACTATTAGT TGTCGAAGTG 2 82 0 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 2 88 0 

50 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GATGTTGATG CTTCGATTGT CGTTGAGTTT 2 94 0 

TGATTACCTA CTGTTTCACT TTGTGTGTTT AATTCAGAAG TTAATGGTGC ATCAGAAGCC 3000 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 3120 

CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 318 0 

5 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 324 0 

TCATGAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 330 0 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 336 0 

w 

TCATTTTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CGTTTTCATA 34 2 0 

TTTTTAAAAT TACTTTTTCT CGTTTATGCG TATAATCTTT TTTTATATAA ATTTGGCTAA 34 8 0 

15 TTGGCTTTAT GTTTAAT CAT TATAATTGTT TCGTTTTTAA AATAATTATT GTATTAATAT 354 0 

ATCTATACCA TCCACCTTTT ATTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3 600 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 3 660 

20 AATATATGTG TAATTCTAAA AGATTCTTCT TTAAAAATAT AAATACCACG ACATATTGCT 3 72 0 

TTAACATTTT CATTTATAAA GCGAAAAAAT GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 378 0 

GATGCATTGA ATT C ACT AAA ATGATTAAAT TACTTATATC TTTTCATCTG ATTGATTATC 3 84 0 

25 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 90 0 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTTGGGCAAC 4 02 0 

30 

TCTTCCTATA ACAATATCTA CGGTAGATAC TGGATCTCCA GTTGGAATAC GAATCGTTTG 4 08 0 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGCTTCTAT 414 0 

35 ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 42 0 0 

CGATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC AT ATT CCA CA CAAACCAACC 426 0 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 4 32 0 

40 TGCTGCGAAC ATAACCATAG GCGGATCCCA AGTTAAGTTT TGATACTGAC TATAAGGCGC 4 3 80 

TAAATTATCT TTCCCATCTT TCGATACAGT AGAGATCCAC CCTATTGGAC GTGGTACTGT 4 44 0 

ACTACTTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCGAATA 4 500 

45 

ATTCATACTA TCTTCACC CC TTATAAGTAA TTACATTTAA GGTTACGCCC TCTTTACATA 4 56 0 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA CTATATATGA CGTGGTTGCT 4 62 0 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT AATATTTATG 4 6 80 

SO 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC ATGATTAGTT TATAATAATA 4 74 0 

AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 800 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4 92 0 

GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4 98 0 

5 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 504 0 

AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 5100 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 516 0 

10 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

, 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 



25 



30 



35 



45 



50 



CTACACACTA 


AACCTATTTC 


AGTTATGGGT 


GGTACAGTGA 


TTCTCTTTTC 


ATTTTTAATA 


60 


GGTATTTGGA 


TTGGTCATCC 


TATTGAAACA 


GAAATCAAAC 


CACTTATTAT 


TGGTGCGATT 


120 


ATTATGTACG 


TACTTGGGCT 


TGTAGATGAT 


ATCTACGATT 


TGAAACCGTA 


TATAAAATTG 


180 


GCTGGTCAAA 


TTGCCGCTGC 


CTTAGTAGTT 


GCTTTTTATG 


GTGTGACTAT 


TGATTTTATT 


240 


TCGTTGCCAA 


TGGGTACAAC 


GATTCATTTT 


GGATTTCTTA 


GTATTCCAAT 


TACTGTGATT 


300 


TGGATTGTTG 


CTATTACAAA 


TGCAATTAAC 


TTAATTGATG 


GACTCGATGG 


TTTGGCGTCG 


360 


GGTGTTTCkG 


CAATCGGACT 


CATTACAATA 


GGGTTCATTG 


CAATTTTACA 


AGCTAATATT 


420 


TTCATAACGA 


TGATTTGTTG 


TGTTTTATTA 


GG CTCTTTAA 


TTGGG'rrriT 


ATTTTACAAT 


480 


TTCCATCCTG 


CCAAAATATT 


TTTAGGTGAT 


AGTGGGGCTT 


TAATGATTGG 


ATTTATCATC 


540 


GGATTCCTTT 


CTTTACTCGG 


ATTCAAAAAT 


ATTACAATTA 


TTGCATTGTT 


CTTCCCAATT 


600 


GTTATCTTAG 


CAGTTCCATT 


CATTGATACT 


TTGTTCGCAA 


TGATTCGACG 


TGTGAAAAAA 


660 


GGGCAGCATA 


TAATGCAAGC 


TGATAAATCG 


CATTTGCATC 


ATAAACTATT 


AGCTTTAGGC 


720 


TACACACATA 


GACAAACAGT 


ATTATTAATC 


TATTCAATCT 


CTATTTTATT 


TAGTCTTTCG 


780 


AGCATTATTT 


TGTATGTATC 


GCCACCATTA 


GGTGTTGTAT 


TAATGTTTGT 


ATTAATCATA 


840 


TTTAGTATTG 


AATTAATTGT 


TGAATTTACA 


GGATTAATAG 


ATAACAACTA 


CCGACCAATA 


900 


TTAAATTTAA 


TTAGTCGTAA 


GTCATCTCAT 


AAAGAGGAAT 


AGGGAATGAA 


AG CAT AGCTG 


960 


TATGGGATAA 


TTTGTATTAT 


ATGGCTTTAC 


TCTTTACAAT 


TTTTTTGTAT 


TAAATTTCAA 


1020 
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ATTTACCGTC TTATGATAGT GCTTTTTATT TTTATTCAGT TGGTATATCG AAAGGTAACT 114 0 

GCTTTGGAGT TTCTTCAGTC AAATCGAAAT TTCCTGCAGT CATTTGATTT AAAAAGTTAA 1200 

TAAACGCTTC ATAGTCACTT TTAACGACAT CGATATAGTA GCTTACCTTA TCAGTGTAAG 126 0 

TTTGGTTTCT TAACATAAAA TGAGTTGAAG CTAATTCATA TTCAAATTTA CCAGTTTGAT 132 0 

CATAATTCAG TGTTACTATA CATGGTACTG CTTCTCGTAG TTCGACACGC CCGATATCAT 13 80 

AAATGACGTC TCTAACAGCA CCGCTATAGG CGCGAATTAA ACCGCCACCA CCTAATTTAA 144 0 

TACCACCAAA ATATCTTGTT ACTACGACAC ACGCATTATG aACATCGrGC TTTTTTaATA 1500 

TGTCTAACAT TGGGaCACCG GcAGTTCCtG TCGGTTCACC ATCATCATTC GCnTTTTGAA 1560 

TATTCATTtC AGGTCCAATA GTATATGCAG AACAATTATG AGTGGCATCT TTATGTTCTT 1620 

TTTTTATTGC AGCAATAAAT GCTTTaGCTT CATCTTCATT TTGAACAGGT TTGATATGAG 16 80 

CAATGAATCT TGATTTACTA ATCACATTTT CAATAATGTG TTCTTTTTTA ACAGTAATGA 174 0 

TATTTTGTGT CATAATAACT CCTTAATTCA TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

CACTGAAAAT GACATGACTA TAAATCGTTT GATTGCCATT TTCTTTTTAA CTGAAATATT 1860 

GTATCATTGC TATGAGTATA TTTTAGGAGG ACG ACT ATG A AAATTGCTGT GATGACCGAT 1920 

TCTACAAGTT ATCTGTCGCA GGACTTAATC GATAAATATA AT ATT c AAAT AGCGCCATTA 1980 

AGTGTGACTT TTGAAGATGG CAAGATTATA CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2 04 0 

G C CATTC AAA CATTAGAAAA GAAAGTATTA GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

TTATTTGTCA TAAATGGAGA TCATTTCGAA GATGGTCAAG CGTTATACAA AAAGTTACAA 2160 

GATGATTGTC CTTCAGCTTA TCAAGTAGCA TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2 220 

CATTTAGGTT CTGGTGGATT AGGTTTAGGC TATGTTGGCA GAAAAATAAG ATTAACATAA 22 8 0 

TTATAAAATT TTAATAAAAG AGTCTATATT GTAATTGGAA ATTATCTCTC GTATACATGG 234 0 

CTTTAAATGT TCATCATTTG AAAGCCAAAA TGCTAAAGAT ATAAGAAAAT CATTATAATA 24 00 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT TTAAGCATTA AACATTACGG GAAATTAATT 24 60 

CATCCTCATA CTTCACTTAC TAATGAAAAA ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 

45 CAAAATTCAA ACTATTATTG TGTTCAATGT GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 

TATGATTCCT CAGTACATTC CAAGAAAATT GTATATTGCA GAAATTGTAT ATCACTGGGT 264 0 

CGAATGGATA ATGTAACAAG ATATAAAATA ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 27 0 0 

SO 

TATTATCATC TCTCATTTGA ATTGTCGGAA CAGCAGTCTT ATGCCTCAGA ACATATTGTT 276 0 

CGAGCCATTA GAAAGAGACA AACGATTTTG TTATATGCCG TAACAGGTGC AGGTAAGACA 2 82 0 
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TCACCACGTG TAGATGTTGT TGTAGAAATT AGTAAACGTA TTAAAGACGC ATTTCTTAAT 2 94 0 

GAAGATATAG ACATACTACA CCAGCAATCA AGACAACAAT TTGAAGGGCA TTTTGTTGTA 3 000 

5 TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTTTG ATACTATTTT TATTGATGAA 3 060 

GTCGATGCCT TTCCTTTATC AATGGATAAA AATTTACAAC AAG CATTGAA GTCATCTTCT 312 0 

AAAGTTGAAC ATGCAACAAT TTATATGACA GCAACACCAC CGAAACAACT TCTGTCAGAG 318 0 

w 

ATTCCCCACG AAAATATAAT TAAATTGCCA GCTCGCTTTC ATAAAAAATC ACTTCCAGTT 3 24 0 

CCTAAATATC GTTATTTCAA ACTTAATAAT AAGAAGATTC AGAAAATGTT ATACCGAATT 330 0 

TTACAAGATC AAATTAATAA TCAACGTTAT ACACTGGTGT TTTTTAACAA TATAGAAACA 336 0 

15 

ATGATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA AATTAACATA CGTCCATAGC 342 0 

GAGGATGTTT TTCGCTTTGA AAAAGTTGAA CAATTAAGGA ATGGACATTT CGATGTCATT 34 80 

20 TTTACTACGA CAATATTAGA ACGTGGATTT ACAATGGCAA ATTTGG AT G T TGTTGTTATC 3 54 0 

GATGCACATC AATATACTCA AGAGGCTTTA ATACAAATTG CTGGACGTG 358 9 
(2) INFORMATION FOR SEQ ID NO: 266: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

TTTCCAAGAA GGcTTgAAAA AtGTTwCaAG TGGTGCGAmC CCAGTTGGTT TACGACAAGG 60 

35 

TATCGACAAA GCAGTTAAAG TTGCTGTTGA AGCGTTACAT GAAAATTCTC AAAAAGTTGA 12 0 

AAATAAAAAT GAAATTGCGC AAG T AGGTGC GATTTCAGCA GCAGATGAAG AAATTGGACG 180 

40 TTATATTTCT GAAGCTATGG AAAAAGTAGG TAACGATGGT GTCATTACAA TTGAAGAATC 24 0 

AAATGGACTA AACACTGAAC TAGAAGTGGT TGAAGGTATG CAATTTGATC GTGGTTATCA 3 00 

ATCACCGTAT ATGGTTACTG ATTCAGATAA AATGGTTGCT GAATTAGAAC GCCCATACAT 3 60 

45 TTTAGTAACA GATAAGAAAA TCTCGTCTTT CCAAGATATC TTACCTTTAT TAGAACAAGT 420 

GGTTCAATCT AATCGTCCAA TCTTAATTGT AGCTGATGAA GTTGAAGGCG ATGCATTAAC 4 80 

AAATATCGTG CTAAACCGTA TGCGTGGCAC ATTTACAGCT GTTGCAGTAA AAGCACCTGG 54 0 

50 TTTTGGTGAT CGTAGAAAAG CGATGCTTGA AGATTTAGCT ATTTTAACTG GTGCGCAAGT 600 

GATTACTGAT GATTTAGGCT TAGATTTAAA AGATGCATCA ATTGATATGT TAGGTACTGC 660 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 780 

TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 64 0 

AgTAgGTGCA GCAAGTGAAA GAG AG CTTAA AGAACGTAAA TTACGTATTG AAGATG CATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 960 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTGAAA CAGGTGT 1017 



w 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 267: 

TTAATCCAGC GTTAACTGTA TTTG CATTTA TTATGATTAT TTCGATTCTT TTAGCGTATG 6 0 

TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTCAACTA 12 0 

25 TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG 18 0 

GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGG 24 0 

TCTATGGCGC AATCAATGGT CAAGGCGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 3 00 

30 

TTTTCACAGC AATTTCATAT ATTTTAGGTG TTCAATTTAA AAGAATGTCA TTTTTACAAA 36 0 

AATTGATGGA TGGTACGACG CAAATCGGTA TTCGTGCGGT ATTTG CATT A ATAATATTAT 4 20 

TAGTAGCCCT AGCAGAGGGA GTTGGCGCAG AAAATATATT AGGTGCATTC TTAGCAGGTG 4 80 

35 

TCGTTGTTTC ATTATTAAAT CCAGATGAAG AAATGGTTGA AAAGTTAGAC TCATTTGGTT 54 0 

ATGGGTTCTT TATTCCTATT TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 6 00 

40 TAATTAAAGA ACCGAAATTA CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 66 0 

CAAAATTAAT TCCAGTCATG TTTATTCGAC GTTGGTTTGA TATGAAAACA ACGATTG CAT 72 0 

CAGCATTTTT ATTAACATCA ACATTATCGC TCGTGATAGC TGCAGCCAAA ATTTCAGAAA 780 

45 GATTAAATGC TATTTCAGCT GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 84 0 

GTGTATTCGT TCCGATTATT TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACCGTA 900 

AAATTGAAGT TAGTTTAATT GGTAAAAATC AATTAACGAT TCCTATAGCG CAAAATTTAA 96 0 

50 

CATCTCAGTT ATATGACGTG ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 10 2 0 

TGTCAGATGA TATCACGATG ATAGAAATTG CTGATTATGA ACAAGATGTT TTAGAACGAC 10 8 0 
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AAGTTGCTAA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 



1200 



CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 



1260 



ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 



1320 



TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 



1380 



GTAATTTCCC ATTCGGAGGA GACATCATC 



1409 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 
EB) TYPE: nucleic acid 
CO STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

AAAGAGGGTT TTTTACTACT ATAATCCATC CTTTAATGGA ATTCCATTTG TCCCTTTGGC 60 

CCGTTCATAA CCATAGGAAA ATATATGCnG nAATCATTGA TAGTAAATAC ATCAATAATA 12 0 

CGAATAATGT CGGTGAAATC ACGTCCGTAT TACCATTATT AGCTAAAACA TTTTCCAAGT 180 

TTTCTTTTGA ACCGGATACT CAT AAA TG CT TTT AATGCn T GGTTTTGTCT GTCG C CATCT 24 0 

TTAGAATTTG TAATACTTTT TTAAATTCTT TAGCGAACAA CTCATTATCT TTAT CGTTTT 3 00 

TAGCCATTTG ACGATTCAAA TCGTTAGCTC TTACGCCTAA TGCTCGACCT GTCGCATGTA 36 0 

GCTTATTCAC ATTATTGTCG ACTTGATTTA ATTGGCCACT AACAGAATCT GCAATTGATT 42 0 

TTGATTCTTG TGTATCTGAT AGCAATTGCG T ACT CTTTT C AGAAATTCTA CTAATTTCTT 480 

TATCTAAATT TGAAGACATC GTATTAAATT CATCATTTTT GCCTTTATCA ATTTTTGGTT 54 0 

CTTGTGGCTC TTCAGCAAAA GTCTTTTTAA CGTTTTCTAA CTGATCAATC AGCTTGGAAA 600 

TATCTTCTTk. ATTTTTTGTT gT ATT CTTTT TGTTATTTAA AATGTCATCA ATCAGTTTGT 660 

CTGAGTTTTC TTCCATTGAA TCAATTTGaT GTAACACAGC TACTTTATCG TCTTTGAAAC 72 0 

TTTCCATGTC ATTGATAACT TGGTCAACCA TCATATCAAT TAAACGTTTG TTGTCGAATG 78 0 

GTTTATCTTC ^CTGC CTTTT GTATCTGTGT ACATTTTATA ATGCGCATCA AACCTAGATA 84 0 

ATG CACTCAA TTGCTGGCTT AATGCATCTT TCGATAAGCG ACCATCaAGG TTATGATTCA 900 

ATGTTACATC CACAACACTC GTTGCTTTCT TATCATTTGG TTCATCTTGA CGATTTGCTT 96 0 

GTCCAAATAA CAATTGTAAA TGCATTGTTT TATCTTTTAA GAAATCTTTC TCAGCATCCT 1020 

TTTTCAATTT AGCAACGCCA TTGACTTCAA CTTTATATTC CTTGTTAGAT GTATCGAGTT 108 0 



1077 



EP0 786 519 A2 

TAAAATGAGG At CTGTTGCA ACAGTTAATT GATTAATATC ATTACTTTTA ATCGTTTCAG 120 0 

TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTTGT GTCTTGCGCA ACTTGGTTAT 126 0 

5 CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGGTTTTTT CAAATTATAT TCATTTTTAT 132 0 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 1380 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 14 4 0 

10 

TG A CAATAT C ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 1500 

TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 1560 

CTGTTAATTG CTTACGATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTGATT 162 0 

15 

GAATGATTTC TCTTAATTTA TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCTTTAC 16 8 0 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 174 0 

20 TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 1800 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTG TTTTT C AATTAAAGTC GAAACATTAT 186 0 

AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA TAATGATTTA TTGTATGTTT 192 0 

25 GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 198 0 

ATTCCGGGAA GTCGTTAATT GGATTTAATA AG T AATTCG A GAATTTACTA TTCACACCAT 2 04 0 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 

30 

TTGTTAAATA AATTTCGACT AAGTTTTTGT TAAAGTCGTT AAGTACATTA CTTACAACTT 2160 

TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TGTTTTATAC TGTAGCGATA 222 0 

TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TGAAAAGTTT TCTGGGATAA 228 0 

CAATCATGAC TTGGTATCCA CCA TTTTT CA AACCAGACTC AGCAACGTTT CTTGTTACTG 234 0 

TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 24 0 0 

CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 24 6 0 

TATTTTTATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 2 52 0 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 258 0 

45 TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 2640 

AAAGCCACAT TAGAAAATGT GAC TTTG CCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 2700 

ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCATCAGCA GTGCtATTcA ATTGTTGTTT 2 760 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 2 82 0 

TTCGAAACGG CTG AAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 2 880 
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TCTGATTTCC TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 3 00 0 

AAGTTTATCa AAACTTTTTA GGGACACTAT TTTTTGAAAA AGTGCTCCTT ACTCAAATAA 3 06 0 

TATATAAATT ATT AG TAT AT GTATATAGTt TTTTAAGTAT TTTTAGCTTT TTTAAAATAA 312 0 

ATATATTGAA TATAACCATA TATTTTTAAT TAACCATTCA TTTTTGTAAT ATAAATGTGT 318 0 

ATACTAAAAT TAAATTAAAT ACATAAAGGA TTAAATGGTT ATTATGAAGA AAACAATTTT 324 0 

ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 3 3 00 

GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAACGCAC AGTTCAAAGT 3 3 60 

GAATAAAGAC ACCGGCGCGT ATACTTATAC ATATGACAAA AACAACACGC CAAACAACAA 3 420 

TCATCAAAAC CAG TCACGT A CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 34 8 0 

CAACAATCAG TACCATTCTT CATTAAGTGG TCAGTATACG CACATTAATG ACGCAATTGA 3 54 0 

TTCACACACA CCGCCTCAAA CGTCACCAAG CAATCCTTTG ACACCAGCAA TACCGAATGT 3 6 00 

CGAAGACAAT GACGATGAAT TAAATAACGC TTTTTCAAAA GATAACAAAG GGCTTATTAC 3 660 

AGGCATCGAT TTAGACGAAT TGTATGACGA ATTACAAATC GCCGAATTTA ATGACAAAGC 3 720 

AAAGACCGCT GACGGTAAAC CTTTAGCATT AGGTAACGGT AAAATCATTG ATCAGCCTCT 3 780 

TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTTTGATAA 3 84 0 

ACGTGCCAAA GATGGACACA CGATTAGTAC ATTTTGGGGA GATGCTAAAA ACTGGGCAGG 3 90 0 

CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATTTTACA 3 960 

AACAGTAAAT GGTCCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA TTGATGGAAG 4 02 0 

TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAACCATCTC 4 080 

TGCTTCAGAA GTTTCATCAT ATAATTACAT CCATTAAATT AATCATGACA TCAATAAAAA 4140 

GCGA€CAGTT CGCAGTTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTTGTATG 4 200 

TTATTCGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 4 260 

TACGAAAGTA TAATTGTAGC TATAATAATC CAAGTCGTAA CAACTAGTGG CACTATCGTC 4320 

TTGAATAAGA ATATACCGTA TTTTTTCTTG CGATATATAT CCAGTACTAG CCAAATTAAA 4 380 

4S ATGATTATAA CACCAACAAA AATAAATACA GGATTCATCG ATATAGCATC TGCCTGTAAC 44 4 0 

TCAGGTTGCA TTCTTAATTT AGTGATAATT AACATCACTA CTGAAATAAT GAAAAAGTAG 4 500 

ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAATGAC CTCTTTAGAT 4 56 0 

TCACCCAATT CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 4 620 

ATTGCAAAGT ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 4 6 80 
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(2} INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ACAAAAAATT CACCCTCATT AATATTGAAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 6 0 

fS TAACG CTT AG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CT CAT AG CAT 120 

AAAACCGAGA TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 180 

TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATGAA TAAwAGGwAA TCwCCAGTTA 24 0 

20 AGAAATAGTG TTATTTTACC TTGAATTCAA AAAAACACCC AGTAAAACAA GGAATGCTTA 3 00 

CTAGGTGTCT TCACTATACT TTGGCTTTAT AATTTTGAAT CGTTt CTAAA AATGCTGGAC 3 60 

AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 42 0 

25 CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAATCAGA AAATATACTT A CAGGTG CT A 4 80 

TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 54 0 

CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 600 

30 

TGAATACTTC TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 660 

CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 720 

ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 7B0 

35 

TATAATCTGk TGACTCTCCT CTTAACACTT GAATGATAAC ACTATAACTC TCTTGTTGTT 84 0 

TCATACGAGC GATGCAACTA ATAATCATCT TAGCTTCTTG TGTCATATTA TATGATTTAT 900 

40 CTTGTTGAAC ACAATTACTA CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 96 0 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATTTTAGTTA 1020 

ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 1080 

45 CGTGTAAATT GAT At CGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCCGTCAC 114 0 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATTCTAAATC TCCAGGCATA TTATAATGAA 120 0 

TAACAAAGCG TACATTGGAT TTATCAATAC CCATACCAAA AGCATTTGTA GCAACGACTA 126 0 

so 

CTTTAACACG ATCAAATAAG AAATCATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 1320 

CTGCATGATA TATAACACTT TCAATTTTCT GACTTTCTAA GGCTTCTTGA AGCTCTTCAA 13 80 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 150 0 

TAG TACT CGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 1560 

GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 1620 

ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 16 8 0 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 174 0 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 1800 

CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 1860 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 1920 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC A CAT 2004 
(2) INFORMATION FOR SEQ ID NO: 2 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTGAAT TGCCACATTT TTATCAAGAG 60 

TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 120 

GATTTT CTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT CCAAGTTATA 180 

CGTATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 24 0 

CGTAATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 3 00 

TGTATTATTA CTTTGCTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 3 60 

GATTACGCAC ATTTAAATTT AGAACATTTT TTTGCACGAA ACGACGATTT AGATGTTATA 42 0 

AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 4 80 

GGTCAAATTG AAGGCACGAT TGATTTAAAT CAGTACTATT ATAAAAATAG ATCAAATGCA 54 0 

G CAAGTTTT A TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 600 

TGTTTTTTAA TACGTGAATA ATAAGATTAT TGATATAAAG GTTTTCAAAG GTTATACAAA 6 60 

AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 720 

CGTTTCATTT ACATTACTGT TTATTTATGA ATATGTAACA ATGCATAGAT AAAATTGTTA 780 



55 



1081 



EP0 786 519 A2 

ACCTAAGAGG TGTGGATATG AATAAACACA AGAAAGGTTC TATTTTTGGA ATAATAGGAC 90 0 

TTGTTGTCAT ATTTGCTGTT GTCyCaTTTT TATTTTTCTC AATGATATCC GATCAGATAT 96 0 

TTTTCAAACA TGTTAAATCC GACA1TAAGA TTGAAAAGTT AAATGTTACA TTAAACGATG 1020 

CAGCAAAGAA ACAAATAAAT AATTATACGA GTCAACAGGT AT C AAA T AAA AAGAATGATG 10 8 0 

CATGGAGAGA TGCATCTGCA ACTGAAATTA AAAGTGCAAT GGATAGCGGT ACTTTTATCG 1140 

ATAATGAAAA GCAAAAATAT CAATTTTTAG ATTTATCAAA GTATCAAGGG ATTGATAAAA 1200 

ATAGAATTAA ACGTATGTTA GTAGATAGAC CAACGTTATT GAAACATACG GATGATTTCT 12 60 

TAAAAGCTGC TAAAGATAAG CACGTTAACG AAGTTTATTT AATTTCACAT GCATTATTAG 13 2 0 

AAACTGGCGC AGTTAAAAGT GAATTAGCTA ATGGAGTCGA AATTGATGGC AAAAAGTACT 13 8 0 

ACAATTTCTA TGGAGTAGGA GCCCTTGATA AAGACCCAAT TAAAACAGGT GCAGAATATG 14 4 0 

CTAAAAAGCA TGGTTGGGAT ACACCTGAAA AAGCTATTTC AGGCGGTGCT GATTTCATTC 15 00 

ATAAGCACTT CTTATCAAGC ACAGATCAAA ATACATTGTA TAGTATGAGA TGGAATCCAA 156 0 

AAAATCCAGG AGAACATCAA TATGCTACAG ATATTAAGTG GGCAGAAAGT AATGCAACAA 16 2 0 

TTATCGCTGA CTTTTATAAG AACATGAAGA CTGAAGGAAA ATACTTCAAA TACTTTGTGT 16 80 

ATAAAGATGA CAGTAAACAT TTGAATAAGT AATTTGATAA GCTACGAGTT GTTTTTATGA 174 0 

CTCGGACATA CTAAAAAGAC GCTTTCTATC TTGTTTTGAT AGAAAGCGTC TTTTTGCATT 18 00 

AGAGAAAACA CATTGATkGA TAAtCCCaCC aATGCAAgTG GGGcAGGACa TCGATAAAGA 18 6 0 

ATTACTTTTT CTTTAGAAAT TAGTATTTCT TATGCATGAG TTTTACT CAT GTATTCCTAT 192 0 

TTTTAAGTAC ACATTAGTTA TAGCTAATGA TAAAGAACCA CTACATAATA AATCATTAGT 1980 

GTTTTTTTAT CATTTCTGTC CCaCTCTCAT CGTGATTTGA AATTTTCAAT TGCGATTTTA 2 04 0 

ATTTCATCTC TTACACGTTG GAACTCTGAC CAAGGCTTGC CTGCAGGATC ATCAAATCCC 2100 

CAATGTTCTT TCXTAACATT TGTTGGTAAA GAAGGGCAAT TTACGTCTGC ATCACTACAT 2160 

AATGTAACAA CTAAATTTGA ATTTnTAATA ATATTATTAT CGGATTAAAA TCTGATGGAT 22 2 0 

GATTTGATAT ATCAATGCCT ACTT 2244 
(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 6 0 

AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 120 

5 TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 180 

TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 24 0 

TAAAAATTAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 3 00 

10 

AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 3 60 

GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 420 

CGAAGTTAGA GATGGATCGC AGACATATTC GAACTCGTAT GAATGAAATT AAACATCAAT 4 80 

15 

TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 54 0 

TTCAAGTAGC TTTAGTTGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATGTTTTAG 6 00 

20 CAAATGAAGA GACGTATGAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACG A C 6 60 

AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 72 0 

AACTACCTAC GACGTTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAG 7 80 

25 ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TATGACACAG 84 0 

TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 9 00 

AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTTTTG 960 

30 

TTTCTTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATGAAATCA 1020 

AAAGGCAATT AACTTATTAT GATGAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 1080 

TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 1140 

35 

GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 1200 

TAGTAGCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAATTGAA GAAACAGCAT 1260 

40 ATATTAATCA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 1320 

TACAAGGATC AACAGGATAC GGGTATGATG ACTTTGGACG TGATCATTTA G 1371 
(2) INFORMATION FOR SEQ ID NO: 272: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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CGTAAAcCTA TGCGTTTTAA TATTCTGAAG 
TGTTTAATAT AAATAATACA TCCAGCTGCT 
5 ACAAATATTA GAATACCGTT AGTAGCATcG 

ACTTTGTCTC CAAACTGTTT TGCAATTGCT 
ATATCATATC CATAAAAAGT ATGAACGTTA 

10 

GGACTTACTT CGATGACAGG TGAGTTGAAG 
TCTTCTTGTG TAcACGGAAA GTTTCATTCT 
AATGGATATT AATAATGTTA GGCAGCGATT 

15 

TtAACTTGTT ATTTGCACTT AAAATAGAAT 
AAGTAATAAC CTGATCATTA ACATTATCTA 

2Q ATGTAATTTG TTGTTGGCTT AGTTTAGTTT 

CGTTAAATTC ATTTGGTGCC ATAGATGTAA 
CCG CAAAACA CAATACAGTT ACTGTAACTG 

25 TGGCATTTTT CTTCATTCTG TACATAATAG 

CGCGTCCATT TTTTGATTTT TTCAATGTTT 
ATAAATAGGC GCCTACAACC GTTAAAAATA 

20 

TTAGTGCTTT GAACGTACCA AACATTTCTG 

CAATACCTAA TACGCCTGAA ATAACCTCTG 

TTTTAATTGA ATCTTTCATC ATTGATAAGA 

35 

GAAATAAAAT CAGTACATAC GCAATAATTA 
GTTCGAAATG TAT CGATAAG TTAATCGATA 
ATAaTTGTGC ACCTGCAATG CCACATAATA 

40 

TCATTtGTTC AAGTGCTAAC ATTTnCAAAA 
ATAGCGCAAA TTCACGTGTA CGGCGTTTTA 

45 TGACAATAAT GATAAATAAA AATATTGATC 

TCGAGTCGTC ATTATTTACA CCTTTAGTAA 
CGATGCTAAA AAATAATGAA AATAAATACA 

SO TTTTGAAAAC GATATGGTTA AATGTCATTT 

ACAATGTCTT CATAAAAGGC CTGTTTAGAA 

55 



TTACTTAGTT CATCCTCAGT TT CAT C C ATT 12 0 

ACTAAAAATG CTAATCCTAA AAATGATGTA 180 

ACCTCTTTTT TCATTTCATC ATACGTGATG 24 0 

TGAGCTTTTT CCTTTTGTGA TGTT TGTTT A 3 00 

TTTTGTGTTT TCAACTGCTG ATACTTTTCA 3 60 

CTTAGATTTA AAGGATAAAC CTTACCTTTG 42 0 

TAGTACCTTT TACTACTAAA TCTTTGTTTA 480 

TTGTATTTGT AATGATGGCA TTGTTGCCTG 54 0 

TCGTGCGACC TGAATCACTA CCATTTTCCA 600 

CAGTAATAGT TTCGTAAGCA TTTTTAGAAA 6 60 

CAAATTGTTT AGCATCTTGA GTAGCGACCA 720 

GGG TTTGATC TGTATTTGAT TTAGATAATG 7 80 

CAGAAATAAT TGCAATGATA GTTAAAGACA 84 0 

ACGATGTGAA TACAACATCG GTAATAGATA 900 

TAAAAATAAG TGACACGGAA CTTCTGAAGA 960 

AAATGATAAA CGGTGATGTC ATAGCCATAG 1020 

TCGCCATATA ATAGCCTAGT GCAATCATAG 1080 

CAGTCGTTAC TTTAG CAGTT GTGGCATCAG 114 0 

TACTACGTCT TTTTAGAAAT AAAGCACTTT 1200 

GCATGAAAAT AGTTAAAACA AGGGCCATAG 1260 

ACGACATCAA TTTAGATACT ATGGAAAGCA 1320 

CACCGACAAC ACCTGTGATT AAAAATACGA 13 80 

TGTTT TGTCG TGTTAAACCA ATCaACTGAA 144 0 

CG rmTAAATG ATTGGCATAC ATTAAAAAGA 1500 

CGACTAAAGC ACCTTTCTTA ATGATGGCCA 1560 

ACTGTAAGGT TGTAAAACTG AAATATAAGA 1620 

TTGCATAATG TTTTAAGTTT TGTCGTAAGT 16 80 

GAGACACCAC CTAATACTGA TTGAAGATGT 174 0 

CGTCCTTCCT GATAAAGTTG TGTATGAATT 1800 
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ACCATGACAA TAGTTGTATC AAACGATTTA TTCATTTCTT CCAAACGTTG TAATAGGTCA 1920 

TTTGCACTTT TCGAGTCGAG TGCGCCTGTT GGCTCATCTG CAAATATGAT TTGTGGTTTG 198 0 

5 TGAACAAATG CTCTCGCTGC TGCAGTTCTT TGTTGTTGAC CACCAGATAA TTCGCTAGGG 2 04 0 

TATTTATTTC CTAGGTCATA AATACCTAAT GCTGTCGTGA TCGCTTTATA ATTTTCTTCC 2100 

ATTGTTGCCT TCGACATTTT TTGAACAGAT AAAGGTAACA TAATGTTTTC TTTAACGGTT 216 0 

10 

AATGTCGGCA GAATACTGTA ATCTTGGAAG ATGAAACCTA ATGATTCTTT GCGGAATTTG 2 220 

GCAAGTGCTT TTTGATTAAG TTTATTAAGC TCTTGTCCGT TAGCAATCcg CTACCGCTAG 2280 

AAATTTGGTC AATTGAACTT AGTACATTTA ATAAGGTTGT CTTACCTGAT CCAGAAGGCC 234 0 

15 

CCATAATCGC AACGAATTCG C CTTTTTGT A TGTCAAAGTT AATATCTTTA AGTG CTTGAA 24 00 

ATGTGTGCTT TTTACCGTAT GTTTTTGAAA CATGTgCACT GATAATATCG TCATAGTCTC 24 6 0 

20 ACTCCTTtTG TATTTAATTT CATTTTAAAT AATGTTTGGA GTAGTAGCCT TTATCTAAAC 2 52 0 

TTACAATTCA ATGAATGAAC CTTACAGAGT TGAAArcTAT CGCTACTTAG TAGATTTTTG 2580 

AGTGAGGATA CAG ATT CATC GTACATATTA GACAAAAGCA ATGGTGCTTT CTAAGTGATG 2 64 0 

25 ATGTTTGTGT AAATTGAGAA AAGGGAATTT AATTATTGTA TAATAAATTT TTTGTAAAAA 2 700 

TTAAAAGAGG GTTTTATTTG AAAGGAATTG ATTGTTATGG AAAAAGGAAA TCAAGGTATT 2760 

AAATGGTCTA GTTTAATAAT GGGTGTATTA TTATTAATGT TGGCAGTCGT TATTTTTACA 2 820 

30 TTTCCAATTG AAAATTTTTA TGCTATTACC TGGTTGATTG GACTGTTTGT ATTAATTAAC 28 8 0 

GGTGTGATTC AAATCGTTTA CCGTAGAAAA GCAAAAGCTT TAGTAGGTGG TAACCAAAAT 2 94 0 

TGG ATTCTG T TTATGGGGAT TGTAGATATT CTATTTGGTC TATTAGTTAT TTTTAATGTT 3 000 

35 

GGCGCAAGTT CAGCATTCTT TATTTATATG TTTGCTTTTT GGTTTATTTT TAGTTCTATC 3 060 

TCTGGATTAT TTACGTTTTC GGGTAGTGGT AGCTTAAAAC TAATTTCAGT GATTTTTAAT 3120 

TTATTAGGTA TTGTTTTCGG TGTCATTTTA TTATTTAATC CATTAATGGG TATCGTCTTT 3180 

40 

ATTTCGACGA TGATTGCTAT TGCATTTGTA TTCGTAGGTG TCATTTATGT TGTAGATGCA 324 0 

CTTGCTTAAG TAAAATGAAG CGGTTCAAAA GAAGGGTGTG ACATGAAGTT TGTGTCATAT 3300 

45 CCTTTTTGTT GTGTTTATGA AGCATAAAAA AGGGGCGCTA CCTACAATAA GTAAGATACG 33 60 

CC CAT ATTTT TATATTTTAC TATTATTGTT TTTCAATACG ATTAATAGTT ACATTTAGTC 3 420 

CAAAATATTT TTCTAAAAAA TGTTTATAGT TATCTTTAGT GACATCAAAT TTTTCTGAGC 34 80 

50 TACCATTCCT TGTTAAAGTT AAATGATTTT CAGACATTGT AGCACGGCCA AATGATTGTG 3 54 0 

GCATTGTAAT TAATAAATGC TGTACAAATA TTGAATCTGG ATGCGTTTGA TTATATTCGA 3600 

55 
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AATGATCATT TTCGAATTTT TGAACATAGA 
CGCGGAATGT ACCACTGATA TCAGTAATTG 
5 TGGCATGTAG AGGTAAGTCT CCAAAGCCAA 

AAACAACAAG TGAAGCATGT GAACCGTTCA 
TAGCTGACAT TAATTCAGGA TTAAATCCTT 

10 

TTAATTCATA ACAAAAACCA CCACGTTTAT 
TATTTATAGA TATCGGCTTA CTATTTTGAA 
TAAAACGTGT TGCATAATAA TTTAATGCTT 

15 

CAATTTGTAA ATAATTCTCT AACTTCGCAA 
AGATTATAAT TAAATTTTAA ACAGAAATAC 

2£? TGGATAAATA CATTTTAAAT AG AAAAAT A C 

ACGATTTAAA GATGACTACT GAGAGCGTAg 
AAATG CAACA ATACAAAGGT AG TTG CAAG A 

25 TATGCGACCA AACTTCCAAA TTAAACTTGA 

TAAAATATAG CCTAAATAAT CAAATTCGAT 
TAAATCTAAC TTATCAGCAA GAATTTGTAG 

30 CATGACTGAT ATCGCAGTAA TCGTTATATT 

TGCCCAATTA TAGGCATACT TCATTACAAC 
TGATGCGAAT AAAATTGGTA AAGATAAGAT 
GCCTGAAGAA AGAGCGAGTA ACGCAATTTC 
AAAGCCAAGT GGCAATACGT GCCAACTACG 
TCGAGAAACC AATCCTCTAG ATT CAAG TAA 
ACGACGTAAT TTAGCGAACA AGTTAATTAA 
GATAAGCACT AAAAAGAAAC CTGAAACTAG 

45 ATGCGGTAAT TCATCTTTAG CCCATTTTAC 

TACGACAGAT GAATGTCCAA TTGAAAAATA 
CTGTAATAAT TTGCGaACCG TATTATCTAT 

50 ACGCAAACCT AATGTATATG CAAGAATCCC 

AGCAATC CAT AAAAAACTAA ACCCAATAAC 

55 
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AAATATCCTT GTCTTCGTTG TTAAAAATAG 3 72 0 

GTTGTGTATG CTCAGATGAA GTAATAGGAA 3 78 0 

CATCAGTTAC ATAGAATACA TCATTTATAG 3 84 0 

GACTACGACC GCCACCGGGw GTGTGAATAG 3 9 00 

TTTGTTGTAA ATAGGCTTTG AAAAATGTAT 3 960 

CATGAACAAT TTTATTAAAA AG TG CAT CG A 4 02 0 

CATCAATATT TTCAAAAGGT ACAGTTAACA 4 080 

CAATACTCGG TCGATTATAA CGAGATGAAT 414 0 

TATTCATAAG CATAGCGCCT CCTGTATTAA 4 2 00 

TGAAATTTTA AATTCGAAAG CATTGAATTT 426 0 

GCTCTCAAAA TG AAG T CAT C TCTAAAAGAA 4320 

CATAATGGAA GAAGTGTGCA GGGTGTCTAA 4 3 80 

CAAGTTGCCT TATCTAGACC ATTTGTGTTC 4 44 0 

AATAAGCCAA GTAATTAAAA ATAATGCAAC 4 50 0 

CGAACCAATG AATGCCCAAA ACGCACCATG 4 56 0 

CAATTCAATC ATCCCAATCA CTAATGCTGC 4 62 0 

GTAATAGATT TTGCGAATAG GATTGAAGAA 4 680 

ACCATCTAAT GTATCCAATA AACTCATACC 4 74 0 

TCCGATAAAT GAAATGGCTT GTTGTGATGC 4 80 0 

ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4 86 0 

CGTGATTAAT TTGAAATAAG GTCCTACAAA 4 920 

TGCATCGACT TCAGCTTCTT CAATGTGTTC 4 980 

AGAGATTAAA ATAATTAGAT TCAACACACC 504 0 

TGTACCAATC GTTCCACCAA TATCTTGGAA 5100 

AGATACCCCT AAAAAAACAG CCATTAAAAA 5160 

GAAACCCACA CCAGATGGAT CTTTGCGTTG 5220 

TGCAGCAATG TGATCTGCAT CAAATG CATG 52 80 

CaTACCAAAT AAGATATGAT GGTCTTTTCC 5 340 

G TG TAACAAA ATGACAATAG CTATGTATGG 54 00 
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ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA TAAGTGTTCA TATAACAAAA 552 0 

ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 558 0 

b GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG ATGTCACTTT GCTTAACTTG 564 0 

GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA AATGAATCTC TTTAcCAGTT 5700 

TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTGTGATGCT GGAATGGATA 5760 

w 

TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 582 0 

TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG TTGTAACCAG TGAACGTAAT 58 8 0 

AATTCAATAT CTAAT GCTGC TGTAACAACT ATAAAATCTA TCATTTGTTG ACGTTTAGGC 5940 

15 

GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA ATGATGACTG AACTTCCGTT 6000 

TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 603 5 
20 (2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

20 

TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 6 0 

TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG CTATTTTACA 120 

ATTTCTAGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA TTATCGCTCA CACAATTTAC 180 

35 

ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 24 0 

ATT AG CCA TT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC CAAATGAAAA 3 00 

AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 3 60 

40 

GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 42 0 

AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA TAT CATCTTT 4 80 

45 TTGTGTATCT TCATTGACAA CAT G ATT CAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 54 0 

ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 6 00 

ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 66 0 

50 AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 72 0 

AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA CATGTTTAAT 780 

55 



1087 



EP0 786 519 A2 



CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 



900 



TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATTTC 



960 



ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 



1020 



AAAGAATTAA TTCATATnT 



1039 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

GAGAGAATnT GCAATTAGTT ATTCAATTAG TTGATTTAAG ACATGATCCA ACACAAGATG 60 

ATATCTTAAT GTACAATTAT TTGAAACATT TTGATATTCC TACTTTAGTT ATATGCACTA 120 

ArGAaGACAA AATTCCaAAA GGTAAGGTyC AAAAG CAT AT TAAAAATATT AAGACACAAT 18 0 

TAGATATGGA CCCAGACGAT ACAATTGTAA GTT ATT CATC AATTCAAAAT AATAmaCAAC 24 0 

AACAAATATG GAATTTAATT GAAC CGTATA TIT CAT AG TT TTTGTACGTC AAAACTTATA 3 00 

CAAAAATTTT AAAAATAATG TAAGCACGAA ACTTTTAATT AGTACACAAT TGATAACATT 3 60 

TTTCAACGTT CATCATTTTG TCAAAAACTC AAAAGTAAAT TAGAAAGATT ATAATTTATT 4 20 

TAAGCATCGT ACTTAATTGG ATTTTAAATT ATGTTATAAT ATTTGTATTG TTAGTATATA 4 80 

TGGGGGCTTT TCAAATGCAT TTTATTGCAA TTAGTATAAA TCATCGCACA GCTGATGTgC 54 0 

ACTAAGAGAG CAAGTTACTT TTAGAGATGA TGCCTTACGA ATTGCCCATG AAGATTTATA 600 

TGAAACTAAA TCTAyTTTAG AAAATGgTCA TATTaTCAAC ATGTAATCGA ACTGAAGTAT 660 

ATGCTGTTGT TG AT CAAATT CACACAGGTC GTTACTATAT TCAACGATTT CTAGCTCGTG 720 

CATTTGGATT TGAAGTAGAT GATATTAAAG CAATGTCAGA AGTAAAAGTG GGGGACGAAG 780 

CaGTAGAACA TTTATTGCGT GTCACTTCTG GTTTAGATTC AATCGTACTT GGAGAAACTC 84 0 

AAATTTTAGG TCAAATAAGA GATGCATTTT TCTTAGCGCA AAGCACAGGT ACGACAGGrA 900 

CAATTTTTAA TCATCTATTT AAACAGGCAA TTACTTTTGC AAAAAGAGCA CATAATGAAA 960 

CAGATATAGC TGATAATGCT GTAAGTGTGT CTTATGCTGC GGTCGAGTTG GCGAAAAAAG 102 0 

TATTTGGCAA ATTG AAAAG T AAGCAAGCTA TCATTATTGG TGCAGGGGAA ATGAGTGAAT 1080 

TATCACTATT AAATCTTCTT GGTTCTGGAA TTACTGaTAT TACAGTAGTA AATAGAACAA 114 0 
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TACCAAATTT ACTTGAAAGT GCAGATATTG TGATTAGTTC AACGAGTGCA CAATCTTATA 12 6 0 

TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT TCACTAGTAT 13 20 

TGATTGATAT TGCAGTTCCT CGAGATATTG AACCTGGTAT TAGTGCCATC ACAAACATCT 13 80 

TTAATTATGA TGTTGATGAC TTAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 14 4 0 

TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAG 14 96 



w 

(2) INFORMATION FOR SEQ ID NO: 2 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 

(B) TYPE: nucleic acid 

t5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

CTTGATTTTT TCCCtTTAGT ATTTTCCaTt TGanTGTCGC AGCTT CTAAA TCCTGCTTTG 60 

GTTCTCTAGT G AA CTT CAT A ATTAAAGCAG CTACAACGAA* TGATACAAGT G GAG CAAGG A 120 

•?5 AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC ATTTAAACAG TAAACTATAA 180 

ATGAACCTGG TGACGCGGGA CTTTTAAATC CAAATCCTGT TGCTTGATAA GTTGCAACAC 24 0 

CAGTCATTCC ACCTAAAATA ACAGCGATAA ATAATAAAGG ACGCATTAAT ACATATGGGA 3 00 

30 AATAAATTTC ATGAATACCA CCTAAGAAGT GGATAATTCC AGCACCATAT GACGTTGCTT 3 60 

TTGCAGTGCC TTTTCCAAAA ATCATATAAG CAAGTAAGAT ACCTAAACCT GGTCCAGGGT 42 0 

TAGATTCAAT TGTGTATAAA ATTGATTGAC CAGCTTTTGC AGCTTGATCT GCACCAAGCG 480 

35 

GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC AATTTTTGCA GGCTCTACTA 54 0 

AAATACTTAC AAGTGGAAGT AGGTGTGCAT GTACTAATGC TTCAACTGCC ACTGATAAAA 60 0 

TATGCATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA TCCTGCAATC GTCATGATAA 6 60 

40 

ATCCTAAAAT ACCAGCAGAA AAGTTATTAA ATAACATTTC AAAACCTTGC GGCGTTCTAG 720 

GTTGAATCAA TTGGTCGGTC TTCTTCATTA ACCAACCAAC AAGTGGACCC ATAATCATTG 780 

4S CACCAAGTAA CATTGGTGTA TCAGGTAATG CAACGATGAC CCCCATAGTT GCTGTTGCTG 84 0 

CGATGATACC ACCACGTAAA TCATAAATTA AACGACCACC ACTAAATGCG ATCAATAATG 900 

GGATTAAATA AGTAATCATT GGTCCTGCTA AAGTAGCTAA ATCTTTGTTA GGTAACCATC 960 

50 CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT GAAAGCGCCA ATGTTTGGCA 1020 

TGATCATACT ACTTAAGAAT GATCCAAATG CTTGAACACG ACGACCAATT CCTTTTTTCT 1080 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 12 0 0 

AATTAAATAA AGTGTCATCA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA 12 60 

5 ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 132 0 

gTACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 13 8 0 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GTGGCCTTTA TAGAGTGCAA 144 0 

w 

TATATGTATT TGTAGACGAG GAGGATAGTG ATCGAATAGA TCGGCGGATG CTATCCCGGA 1500 

TGTGGCTCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA 156 0 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 162 0 

15 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTAT GGA TGAATGCGAT GAT AG CATGT 16 80 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA T CGGC AATAA ATCATAATTT 174 0 

20 ACGCGTACTA TTCCAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATATTG 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGGTTTAGA AAAATTAGAA TACAGAGGTT 186 0 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 192 0 

25 GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA I960 

TCGGTCACAC ACGTTGGGCA ACAGACGGTG TACCGAATCA TGAAAACTCT CATCCACATC 2 04 0 

AAT CAT C AAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 2100 

30 TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA GAAGTTATCG 216 0 

TTCAATTAGT TGAATACTTT TCAAATCAAG GACTTTCAAC TGAAGAAGCA TTTACAAAAG 2220 

TTGTGTCATT ATTACATGGT TC AT A TG CAT TAGGTTTATT AGATGCTGAA GACAAAG A CA 22 8 0 

35 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 2 34 0 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG 24 0 0 

ACCATGAAAT CGTTATTGTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 24 6 0 

40 

TTGTAGAACG TGATTCATAT A TTG C TG AAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 252 0 

ATGCACACTA CATGTTAAAA GAAATTCATG AACAACCAGC AGTAATGCGT CGTATTATTC 2580 

45 AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACAT CATC AATGATGTTA 264 0 

AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 2700 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 276 0 

50 TTGTC T ACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2 82 0 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 288 0 
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TACACGCGGG TCCTGAAATC GCAGTTGCAT CTACAAAAGC ATATACTGCA CAAATTGCAG 3 000 

TATTATCAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG TCGTGAAGCA GATATTGATT 3 06 0 

TATTGAGAGA ATT AG CAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 312 0 

TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 3180 

GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA ACTTAAAGAA ATTTCTTACA 324 0 

TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG TACAATTGCC TTAATCGAAG 33 00 

AAGGTACACC AG TTGTAGG T TTAGCAACAC AAGAGAAAGT TAATTTATCA ATTCGTGGTA 3360 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTTCTATG GAGGGTCTTG 34 2 0 

AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA ATTGTTAACG CCATTAGTAT 34 80 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAGATGTTG 3540 

ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA ATTCACTTTT TTAGAATCAA 3600 

TCATGTATTA AAATTAAAGT ATATGGCACC CTTTTAGATT AATCGACTAG AAGGGTGCTT 36 60 

TTTTAGGTCG ACTTaGCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 3720 

GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TTTTATGAGA 37 80 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 3 84 0 

GAATGAAGCA AGGAAAAACA ATTTTAAAAA AGATTTCTTG GCAAATTGCT AAAGGTGATA 3 900 

AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 3 960 

CGTATGAGCC TGCAACATCT GG AACTGTTA ACCTTTTCGG TAAAATGCCA GGCAAGGTAG 4 020 

GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 4 0 80 

AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 414 0 

GTGTCTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA CTTAAATTAG 4200 

TTGGAATGTC TGCTAAAGCG CAACAATATA TTGGTTATTT ATCTACCGGT GAAAAACAAC 4260 

GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 4 32 0 

CAGCTGGTTT AGACTTTATT GCACGAGAAT CGTTGTTAAG TATACTTGAC TCATTGTCAG 43 80 

ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT TATTGAAGAA ATAACTGCTA 444 0 

ACTTTTCCAA AATTTTACTG CTAAAAGATG GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 45 0 0 

ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4 56 0 

GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 462 0 

TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA ATATGGATAA CGTAAAAGCA 46 80 



55 



1091 



EP0 786 519 A2 



36 



40 



45 



AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4 80 0 

TCGCATTCTG AAAATACATC AACTTG 4 826 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 84 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

TTCGT T TTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 24 0 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 3 60 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 4 20 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 4 80 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 54 0 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 84 0 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

A TAT CT AG IT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGACC AAATAATTTT 138 0 

TTCATAAAAT GAAACCCCCA ATTTATCGTT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 144 0 

5 TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT TGATACTAAA ATGACATCAT 150 0 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCG GCACCAATTA 156 0 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTGTAATG CCAGACACTA 162 0 

10 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC TGTCCAAGTA TTAGCGCCCA 168 0 

TTGACCAAGC CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 174 0 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CGCACCTGTC GGACCACTTA 180 0 

ts 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 1860 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA CGATAGAATA AAATAACCTT GCACCGATAG 192 0 

ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 199 0 

20 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TGCAGTCCAA ACTTCTGGCC 204 0 

ACTGAATATT AGGCATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT 210 0 

25 CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 216 0 

ACACTTGAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTTGTATTT 222 0 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 22 8 0 

30 ATAGGCTCGG TTGTTGTTGA ACCAGCGAAA ACTAATCTAA CGATATATGC ATCTTTTTCT 234 0 

AATGGCTCTA ATTCTGTTAA AGATGTTTCG AAATCATCAT TTAAATCGTC TTTCACAAAT 24 0 0 

CGTTTTGTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 246 0 

35 ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATGAC ATGCATTTCA 252 0 

TGCGTAATTA GTACAATTGT TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 258 0 

ATTTCATCTG TTGTTTGCGG ATCAAGTGCA CTTGTTGCCT CATCACAAAG CAAGACCGTT 264 0 

40 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 270 0 

ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGACGAGTTC GACAAGTTCT AATGCTTTTT 2760 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 282 0 

45 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT TGTCTTTTTG 2880 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC ATCTATAATA ACTTCACCTG 2 94 0 

50 ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 300 0 

AACCAATGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 3 060 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA 
AGAAAGCAAT AGTAGTATTG TTTCTCTCAT 
5 CACCATTTCT ATATAAGACG GTTGCCGGGC 

ATAAGAGTTT ACGCATCATT TAATTTGTAT 
ATAACTATTT TAAATTTTCT AACAAATCAG 

10 

CTTCTTTATC TTTATTCATC AACAATAAAA 
AAAACTGAGG ATAAAAGTTT AAATCTATTT 
CAATGTCTAA CATTCTTTCT GaAACCTTAC 

15 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG 
TTTACTAAAA CTAACCTTTC ATTATTTATA 

20 GTAAAGCCAT GTTTTGAAAG TACATGCGCC 

CAGTAAGTTT TATCAATAAA TATATGTTCA 
CTAATTCGAT CTCCTTCGTC ATCAGAATCG 

25 GATTCTATCA TATCATCAAG CTTATCTATA 

ACTGGTTC7G CAATAACCTG TTGCACCCTT 
ACTTTATTTA CAATAGCCAT CATCATCACC 
TCATTTCACA AAATTGGTAT GAATAAAACA 
ATAATAATTA CAAACTCAAT GTTTGACTAG 
TAAGTCAAAA ATCAACATCC TAAAATCTAC 

35 

AAAATACACC TATAACAAAC TTTTCAATTA 
AAGTGAGCTT ACAATAATGT GCAAGTTGGC 
TTAGAAATTA GTATTTCTTA TGCATGAGTT 

40 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG 
TTATATTGGC AGTAGTTGAC TGAACGAAAA 

4$ GTCAGGGGCC CCAACACAGA GAATTTCGAA 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC 
GGACGACGAA ATAAATTTTG CGAAAATATC 

SO AGATACTACA CTTATTAAAG TAATATATTT 

TACATGTAAT CATCAAACTT CATAGCCTCA 

55 



AATGCTTTCT CAATATCGAT AGAAAAATTG 318 0 

CTTCAAAAGT TAAAACTTTA TGTGAATTGG 324 0 

TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3 3 00 

TAATCCTAAC ACCTTAGTAA AATTTCGTCA 33 6 0 

TCACCGATTT AAATG CAT AA ATTCGTTTTA 34 2 0 

TCGGCGTAGA CATGATTTGC ATATCTTTAC 34 8 0 

TCAATAATGG TAACTGCAAT ATTTCATTAG 354 0 

AAGTACCACA CGTTGGTGTA TAACCAAAGA 36 00 

TTACATCTTT GATGTCTAAT GAATTATTCA 36 6 0 

TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

AAATATTGTT TGGGGCAATT CGCAACTTGA 37 80 

CTTTCACTCA AATAACGTTT AAACCAATTT 3 84 0 

GCTAATACAA AAACTTGTTT ATCATACAGT 3 90 0 

CTCATTGTTC CATGAGTACA AATAATATTG 3 96 0 

TTTTTATCAG ATTTTCCTTC AACAATTATC 40 2 0 

CTTTAAAATC AATAAACATC TGTCACTGTA 4 080 

TAAATCACAA AAAATTTAAA CTAGCTTAAT 414 0 

CTGGAACATT TAACATAAGC AGACAAAGGC 4 2 00 

AATGTTATAT TAACAATAGT TAACCAAAAG 426 0 

TAGCGGGGCC CCAACACAGA AGCTGATGGT 4 32 0 

GGGGCCCCAA CATAAAGAAA TACTTTTTCT 43 8 0 

TTACTCATGT ATTCCTATTT TTAAATACAC 444 0 

ACATAAATCA ATGTTCTATG CTCTACGAAg 4 50 0 

TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 4 56 0 

AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4 62 0 

CAATTTCTAC AGACAATGCA AGTTGGGGTG 46 8 0 

ATTTATGTCC CACTCCCTAG ATTGATCTAT 4 74 0 

TTATG ATTC T CTTAGCTGCA ATCCCATGAA 4 800 

AGGTCAGTAG ATTTCA 4 84 6 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 120 

is AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 24 0 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 3 00 

20 

GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 360 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 420 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 4 80 

25 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 54 0 

ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTGAAAAGCA AGCTGGACAA 6 00 

30 GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 660 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 720 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 780 

35 TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 84 0 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 90 0 

CTTAATCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTTT 96 0 

ACTTTTACAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 102 0 

CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 1080 

AGAGAAGGAC AAAAAGGTGA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 114 0 

45 

GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATTAATGAA 1200 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 1260 

so TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 132 0 

GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGG AG A C 13 80 

TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTGATTTA 144 0 
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CCAACACTAA AAAATCCATT AACTGGAGAA AT TATTAG T A AAGGTGAATC GAAAGAAGAA 156 0 

ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 1620 

CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 16 80 

CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 174 0 

AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 18 00 

AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 184 3 
(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 853 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 60 

25 CTTTTCGCGA TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 120 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 180 

TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 24 0 

30 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 3 00 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 3 60 

CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 42 0 

35 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 4 80 

GGCAGTTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 54 0 

40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 600 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 660 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 720 

45 TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTGGTA AGCTGGTACC 78 0 

GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTT CTG AAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

50 

GCGATAGAAC CTA CAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 960 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 1020 
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GTATTGTAAC CTCTATTTGT CGTTGTTGCA 
ACTTTCAATT TTTGACCCGG ATATATAAAG 
5 TTTTGATAAG TTGTACCATA TTTTGATGCA 

GTATAGTATG ATCCGCCACC TGAGTTCGTT 
CTTGAGCTAG CAGTACCTGA TACTTTTAAT 

w 

AAACCATTAA GTCGCATAAT GTTTTGGTAA 
AATGAGTCGC CTGCTTGTAC TGTGTAGTAT 
CTACTATTAC TCGTAGAATT ACTTGAGCCA 

15 

AGATTAGATG TTAAATTGTT TAATGACTTT 
GAAATTGCCC ACACTGATTC ACCCGGTTTT 

20 GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC 
ACTTTAAAAT CCTCCTCTTG CTTAACTTTC 
CTACATCATT ATACGATTTT ATTATGTATA 

25 CAAAAAAATC ATATACTGTA TCATCAAATT 
TATTAGATTT TACAATATCT AAATTATTAC 
ACAATCCCTT TCTCTATTGG AAATAATTTT 

30 

AGCATGAACT TTGCTTGTAG GACGCAATAT 
CTTAATCTCA T C TTT AG AT A AATTAACAGG 
CGCTTTTTGA GCTTCTTTTG AATGATACAA 

35 

CAAGTCTTTG TCATTAACTG CAACAATATT 
TTTTTCTAAA AATATCGGAT CATTTTTAGG 

40 ATTAATAACT GCAATATCGA CATCAGATAA 
AGTAATTTTT AAATGTTTTG GATTTGACGT 
ATCTTTTTTC AGTTTTATTA AACCAGCTGC 

45 TGACACATCA TTTGGAATGA CAACTTTAGC 
TTTATCTGAG TAAATGCCCA ACGGTGCTAA 
TCCTTTATGC GCCTTTTTAT ATTGATCTAA 

50 

AATATCACCA TCATTTAATG CTTTATTCGG 
CACATCAATA TCATCTTTTT TAGCTAATTC 
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GATCCTGAGT TCGTAGATGC aTTACCAGTT 114 0 

AAATTATTTA AGCCATTTAA GCTCATAATT 1200 

ATTAATGACA ATGAGTCACC TGCTTGTACT 126 0 

GATGGACGGC TACTATTGCT CGCAGCGTTA 132 0 

TTTTGACCTG GATAAATAAA GAAATTATTT 138 0 

GTTGTACCAT ATTTTGATGC GATTAATGAT 14 4 0 

GATCCGCCAC CTGAGTTCGT TGATGGACGG 1500 

GATACTTTTA GTACTTGGTT TGGGAAAATT 1560 

AATTTAGCAA TCGAAATCCC ATACTTATTT 1620 

ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 1680 

CCAATAATAG CTGCAATTAC TTTTTTTTGC 174 0 

CTAACATTCT TTTATCCGAA TTTATGAATA 180 0 

ATAGGTTGAT GTTTGATGAC ATTATGaTTA 186 0 

TTATAATTAT CCCTTAAAAT TATTACAACT 192 0 

AATTTCATAA TATTTCACTA TAAAATGATT 198 0 

ATTCTCCAAC AATAACGCCC TACAAACATA 2 04 0 

AATTTATTTT GCTAATGACG TTTCTATTGC 210 0 

TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 216 0 

TTCTACGATT TTAG CATATG TTTTGTTATC 222 0 

AATATATGGC TTTACTGCAT CTGAATTTGA 22 8 0 

ATCTTTACCC GCTTTAGTTG CTACACCGTT 234 0 

AGCACGTGCA GTTTGTTGTG CATCTACTGC 2400 

TATATCTTTC ACCGTGCCTG CTAATCCGAA 24 6 0 

TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 252 0 

ACCATCTTTA ACCTTTTTGA CATCTTTAAT 258 0 

AACTGTTGTA CTTAATGCTG AAATCTTTGT 2 64 0 

AAATGCAAAA TGTTGGAATG CATTCATATC 270 0 

TAAATTGTAA TCTGAGAAGT GCTTAATCTC 2 76 0 

TTTAACCTTC TCCCAAGCCT TAGTGTCATT 2 82 0 
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AAGTGCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAAATCTT 294 0 

CACTATATAT CATTAATGTC TACGTATGAA TCTAGCTAGA ACATTCCCTA GCGTTTGAAT 300 0 

CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 3 06 0 

TTGATAACCA TACACTAAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 312 0 

CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GCTAATACTA ACGAACCTAA 3180 

AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 324 0 

CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 3 30 0 

AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTAC CAATTG AAGTACCTAC 3 360 

TACTAATTTT GTGAATGGCA CAATCGCAAT TAACAAAATA ATAAATGGTA GTGACCTTAA 34 2 0 

AATATTGATT AAAGGATTTA AAACTTGATG TATCACTATA TTGGGCCATA TGCCTTGTTT 34 80 

TCGAGTAATT ACCAATAAGA CACCTAATGG AATACCAATC ACTGCTCCTA AAAATAAAGC 354 0 

AATAGATACC AT AT AT AG C G TTTCGTACAA TGCTTGTAAT AACTGTGCAC TGTCTAAATC 360 0 

AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTG 3660 

ATTTATTGCC GTGTCTTCAA ATTGTTGATC CATATTAAAT CGAAGCCACA TATAACATAC 372 0 

GGTGTTACCT TGTATTTCTG ACATAGATGA AAATAAAATT TTAACCTCTC TGCCACAAAT 378 0 

TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 3 84 0 

ATCTTTAAAA TCGCCAACTT GTTCGTCATT CAATCGACGA AT CAATGATG TACTTGGCTC 3 900 

AGTCTGTATA ACTGTAGACA CAAAATTTTG AGCAATCGTC GTTTTAGGAT GACTAAACAC 3 960 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT CCCCTTTTCC ATTACAGCAA CACGATTACA 4 020 

AATGTCTTTA ATAACGCGCA TTTCATGTGT AATCATCATA ATTGTAATGC CAAAGGTTTG 4 0 80 

ATTGfiCATTC TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 414 0 

TGCTTCATCG CATAGGAGTA TTTTCGGATT AGTAACAAGC GCTCTTGCAA TAGCCACCCT 4200 

TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGGTCT TTTTTATCAC TCAATCCTAC 4 260 

AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4320 

GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATGCTGAAA 43 80 

TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 4440 

ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAGCTTCA AGATGATTCA CGAGTCTTAC 4 500 

CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TGCGATTGAC 4 560 

CGTAAATGAT ACGTCCTTCA AAGCATCTAT TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4 620 
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TAAGAGCATT ATATGTAAAA TTGCATATAT CGTcAATACA ATTTGCCGAA TTTTCTAAAA 4 74 0 

AATTAAAAAA TAAGTAATTC ATGTGACAAT GACGAATTGT GAGACTACTA TGACATTTAT 4 800 

CAAATTAAAT CCATAAAAAT GTCCACCAAT CCTCCACAAC GCAATTACTA AATATTAACA 4 86 0 

TCGCACAAAA AAGCACTAGC ATATTCAAGA ACAACAAACG TTGAACTCAA AATATATGCC 4 92 0 

AGTGCTGCTA TTATTTATAA AGTATCTAGT GCTTGTTTTA AATCATCGAC TAAATCTTCA 4 98 0 

GTATCTTCAA TACCTACAGA AATTCTTACA AGTCCGTCTG TAATACCTTC TTTAGCTCGA 5 04 0 

ATATCTGCTG GAATGGATGC ATGTGTCATC AATGCAGGTA CTGAAATTAA ACTTTCCACT 5100 

GCACCTAAAC TTTCAGCTAA TGTGTAATAC GATGTTGCTT TAATCAATTG TTTGGCACTT 5160 

15 

TCTGTATTTT TCACTTCAAA TGCAATCACA CCTGTATGGC CATCCGCTTG AGCCATATGG 5220 

ACATCATGAT TTAAATGACT TTCAATACTT GGATGGAACA CTTGTTGCAC AGCTGGATGT 52 8 0 

20 GCTTGTAACA TTTTAATAAT TTCAATAACG CTGCGATTAA TTTGTTCCAT ACGTAAACCT 534 0 

AATGTTTTAA TACCCCTCAC AAGTAAATAG CTATCTTGAG GTCCTAAAAT GCCACCTGTT 54 00 

GAATTTGAAA TAAATGCTAA ACGTTCTGCA AGCTTGTCAT CCGATGTTGC AACTAAACCA 54 6 0 

- 5 GCAACGACAT CACTATGTCC ACCTAAATAT TTCGTTGCAG AATGTAAGAC AATATCGATA 5 520 

CCTAAATCTA ATGGATTCTG ATAATAAGGT GTCATAAATG TGTTATCAAC AACTGAAATC 558 0 

AAACCGTGTT CTTTCGCAAT TTCAGCAGAC TTTTTAATGT CAGTAACACG TAATAATGGA 5 64 0 

30 

TTAGAAGGTG TTTCAATAAA CAACATCTTT GTTGTTGGGC GTATCGCTTG TACAATTGAA 5 700 

TCTGTATGCG TTGTATCTAC AAAATCCACT TCAATGCCAA ATCGTGTAAA TACTTTTGTC 576 0 

AATGCGCGAT AAGTACCGCC GTATACATCT GAATTTAAAA TAATATGATC TCCTTTGTCC 5 820 

35 

AACAGCATAA CAACTGCACT GATTGCTGCA ACACCTGAAC TAAATGCAAA GCCATGTTTG 5 88 0 

CCATTTTCTA ATGTCGCAAT AACGCTTTCT ACAGAACTTC TTGTTGGATT CGCAGTACGA 5 94 0 

40 GAATATTCAT ATCCTTGACG TAAATCACCA ATATCATCTT GTAAATATGT ACTTGTTTGA 6000 

TAAATTGGTG TTGTAACGGC ACCTGTATAA TCGTCTGTTG TGTGCCCACC ATGAATTAAT 6 06 0 

TTAGTTTTCT TGTTCATTAT TATTCTCCTC ATAATTAAAT ATTTGCTTAG ACATATATCG 6120 

45 ATCACTACCA TCTGGAAATA CGACAACAAT CGTACCTTCA GATAATTGCG CTTTTAAATT 6180 

CAATGCACCT TGTAATGCTG CACCTGaAGA ACTGCCTACT AACAACCCTT CATTTATAGC 6 24 0 

CAAACTTTTG ACATTTCGAA AGGCATCTTG ATCTTTAATC GTAAATATCC CATCTACAAG 6 3 00 

ACGTCTCTCT AAAAATATCG GCCATTTCTC AGAACCGATA CCTTCAGTGT CATGTGCATG 6 3 60 

AGCTGGCCCT CCATTTAACA CGGACCCTTC TGGCTCAACG GCATAACATT GCACGTGATG 6420 
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ATAATCAATT TGCTGTAATG CTGAAGTCAA TTCGGGTCCC AATGTATGAA AATATGTATC 6 54 0 

CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA CCATATTTTT CAGCATAGGA 6600 

ACGTGCAGCT AATTGTGCCC CATGCATACC TTCAGACTGA CTCGTCCTTG AAACTTCTGC 6660 

ACCAAGCGCT ATCATAATAT TAATCTTTTC TTCTGAAAAA CCATACGGCG CAAAGATCTT 672 0 

ACATTTCAAA TGATGTCTAT TCGCTGCAAT AGCTAACCCT ATGCCTGTAT TACCAGCAGT 678 0 

CGCTTCAACA ATAGTTTGAC CTGCACGCAC ACGCCCTTCT TGAATTGCCT TCTCTACTAA 6 84 0 

ATATTTCCCG AGTCTGTCTT TAACACTGCC TCCAGGATTC CATTGTTCAA GCTTGGCATA 6 900 

AATTTTAACT TTATCATCAC TATAATGTTC TAACAGTACT AATGGTGTAT TGCCaATTAA 6960 

AT CA TAAGT A ATCATAGATG CACCCTCATC TGACATGCCG ATCAAATGAA TGAAACCTTT 702 0 

CTTCATGTCT CAATTTTAAT TCTTACTTTT CAGATAAGAA TTATAAACGA CATTTTGTTA 7 0 80 

1Q TTTTGCAATT ATCTAAGTTT CGATTAATTC AGAACCAGTA CTAAATTTTC AATTCCAAAC 714 0 

AAAAAAACAC CTGAGCAACA CAAATACTTG TGTGTCAGAT GCTT C TAT AT ATTAACTAAA 72 00 

TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC GCATACTTAA TTAAGTAATA 7260 

^ CCAACCACTG AATAATCTAA ATCGATCTTT ACCAAAATAT TGTTGTAATA ATTTTTTATC 7320 

TAATAATTGT CCTACGACAA GCGTAGTACC TAATGCGCCT AATGGCATCA ATACATTCGA 73 80 

AACGATGAAA TCCATATTAT CAAAAATCGT TCCCGCACCG AATCTTACAT CTTTTAAGAT 74 4 0 

ACCAAAAGAT AAGGTTGCTG GAATACTAAT GATAAATACT AAAATACTAC CGATCACTGC 7 50 0 

GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA GAAACATTTA ATTCTAATAA 7 56 0 

AGAAATAGAT GACGTTAAAG CCGCAAATAA GAACAGCACT AAGAATCCCA AATAGAATAA 762 0 

TGTGCCTAGA TGCATTTGAC TAAAGACCAT TGGCAGTACT TTAAATAATA ATCCAGGCCC 76 8 0 

TTCT^GTGGT TCATAGCCAA AACTATGTAA AGCCGGAAAT ATAGCTAGAC CTGCCAATAC 774 0 

AGATACAAAG ATATTCATAA CAACGATAGA AATAGCTGAT GACTTAATCG TCATGTCTTT 780 0 

AGAGGCATAA CTCGCATAAG TAATCATACC TGTAGTTCCT AATGATAACG TAAAGAATGA 78 6 0 

TTGACCTAG C GCAAACAAGA TGCCATCAGC AGTAATCTCT GATACTCTTG GTTGTAAAAT 7 92 0 

45 AAATTTCACA CCTTCTAAGA CGCCATCTAA TGTTAAAGAC TTAATCACAA TGACGATTAA 7980 

AAAGACAAAC AGCAATGGCA TCATAACTTT CGATGCCTTT TCTAATCCTT TTTCAACACC 8 04 0 

TAACATGACA ATAATCATCG TAGCGAATAT GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 

AGGATTTGAT ATTACCGCTT CAAAATTCAT TTCTTGGAGA TGATTGATGC GTTGAAATAT 8160 

AACT AATTG C CATAATACTT GTCCGATGTA AATGACAATC CAACCACCGA TAACACTATA 8220 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 834 0 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 0 0 

AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 84 60 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 

TCTTTTCATC ATTCAC 8 536 



10 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 8 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 6 0 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

25 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 24 0 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 3 00 

30 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 3 60 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

JS AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 4 80 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 54 0 

TTT^TAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

40 AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 7 80 

45 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 84 0 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 96 0 

50 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 102 0 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 10 8 0 
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TAATTTCTTT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 12 0 0 

TAACTGTCGT GTCTTCCATT AATGACCTAA AGAAATCATT CGCAGAAATC TCATATGAAA 12 6 0 

5 TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCACTAAT ATTTGAAATG 13 2 0 

CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 13 80 

TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 144 0 

10 

CATTCGGATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 150 0 

ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 156 0 

, 5 TTAAACCTTT TTTCTTATTT AAAATGTGTC TTAATGCCGA TG CAT CT AT A CTTTTTTGAT 162 0 

AGTGCGGTAA AATCTTCATC TCAGAAGTTA ACAATTCTGT TG CATTCAAA TCAGGTGTTA 16 8 0 

ACGGATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 174 0 

20 GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 1800 

TA GG CAT AT C GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGCTT 1860 

GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 192 0 

25 

TTCTACTAAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT GTACCTGACG 1930 

TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCCAACTGCA AAAAACGCTG 2 04 0 

CACTTCGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 2100 

30 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC CCTTACGCCG TACGCATATA 2160 

ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 2220 

35 TCGACTCCTA ATGCATTCAT CATAGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 22 8 0 

TCTGGATCAG AATCTTTAAC AATGCCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 234 0 

ATAAGCATCG AACGAATTGC AACAATAAAT TCACAATCAT CGTATATATC TAT AT AG CCA 24 00 

40 ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCCATTGCA 24 60 

AATTCTTTTG GATAGCCACC TAAAGCAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 2 52 0 

TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AGTGATATAA ATGATCATTT 2580 

45 

TTTAGAATTT TAGGCGTCTT AT CAT AATGT AATTCAGTGA TATAAGGTTT AATATCATGT 2 64 0 

AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 2700 

TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACGTT TAATTGTACC TGCTACAGCT 276 0 

50 

TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2 82 0 

ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGTTTT TTTCTTTAGA ATATGCTTGC 2 880 
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ACAATTTTTT CTTCATTATT AATAGATTCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3 00O 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3 06 0 

GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT CAACAGTAAA ACTTTCCCTT 312 0 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 3180 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 3 24 0 

AACCCTCCAA CAACTCGAAG ATGATGTTTC TCAGATTGCG GATGTATAAA TGTGATGTTA 3300 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 336 0 

AATTGAATTG CTTTGTAGCC AAAATATGAC GTTCGATTAT CATTCAAACG CATATAAAAG 34 2 0 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGACTGTGAT 34 8 0 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAATATCG 3 54 0 

TCCTCTAATA CGCCCGTAGC CAT CC ATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 3 66 0 

TTTTCACGTC AATTGACCTT TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 3 72 0 

TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3 7 80 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGCTT CCGTAGTACC CGTTTT AG TT 3 84 0 

GGTACAGCAG CAT CT AAAAT ATATTTTCTT GGTAGCGAAG AT CAT ATT AA AATCAGCCTA 3 90 0 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3 96 0 

TACTATGATT ATAAAAAAGG CCTCGATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4 02 0 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACAT CTTA 4 08 0 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 414 0 

TTAGTATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4 2 00 

CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 426 0 

TTTATTCAAA CTGG CAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 4 32 0 

ACTATCGG 43 28 
(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GTTCAATACA GAAAAAATAA ATTTAGATGT TGAAGCATCC TACAATTAAT ACAGATCCAT 60 

TTCAATATAT TTAAACTAAA ATCTCGGGAT TTCTAAATTT TGAAATTTCG AGGTTTTnAT 120 

ATTTTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG 180 

CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 24 0 

TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 300 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT ACAGGGAAAT CTCATGCATT 360 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAACCACAAG CAATCGTAGT 420 

TGCACCAACA AGAGAACTTG CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT 480 

TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 54 0 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 600 

20 TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTGATG AAGCGGATCT 660 

TATGATTGAC TTAGGATTAA TTGAAGATGT AGATTACATT GCTGCAAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG TTACAACCAT TTTTAAATAA 780 

ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA AAAAGAACAT 84 0 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

TGATATACTA AATCCATACT TATGTATTAT TTTCTGTAAT AGTAGAGATA ATGCAAATGA 960 

TTTAGCACGT TCACTAAATG AAGCTGGTAT TAAAGTTGGT ATGATTCATG GTGGCTTAAC 1020 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 1080 

TGCCaGCGAT TTAGCATCTC GTGGTATTGA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

35 

TGATGTGCCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 1200 

TTATTrAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GAACACAATA TTTCATTAAT 1260 

40 AGAAGAT CGC GGTTTTGTAT TCAATACTGT TGATATTAAA GATGGTGAGT TAAAAGAAGT 1320 

TAAAGCGCAC AATCAGCGTC AAGCAAGAAT GCG CAAAGAT GAC CATTTAA CTAATCAAGT 138 0 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 144 0 

45 TAAACAAGAA 1450 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 
AGTCAGGTAT ATCATGCCaT yCTGAATTGG TCGATATTAA TATCAGTGGT GTTAAAGAAC 6 0 

5 GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA TTAAATGCaG AATAGAGAGG 120 

AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT AGATATGGAT GACACATTGC 180 

TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA TTTAACAGCA ATTCAAGATG 24 0 

w 

AAGGTTATTA TGTTGTTCTA GCATCTGGTA GACCTACTGA AGGTATGATT CCAACTGCTA 3 00 

GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG TTATAACGGT AGTAAAACGA 3 60 

TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT TGGTAAGCAA GATTTCGATG 4 20 

AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT TACATATCAT GATGGTCAAA 4 80 

TTATTTaCGA CAGCGAACAT GAGTATATGA ATATTGAAGC AGAATTAACA GGTTTACCGA 54 0 

20 TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCGA TGTACCCAAG GTCATGGGTG 600 

TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT GAATGGTGTG TTCAATGATA 660 

ATGTAGATGC TACGACAAGT AAGCCATTCT TCTTAGAATT TATGGCCAAA GACGTTTCAA 720 

25 AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA TTCGGTGGAT CAAGTCATTG 7B0 

CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAAGT CGCAGGTCTA GCTATTGCTA 84 0 

TGGGGaATGC ATCAGATGAA CTTAAGCAAT ATGCAAATGA AGTTACGTTG GATCATAATG 900 

30 

AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT TTTAAAATAA GCCTTAACAC 960 

ATGATATTTG AATAAGATAT CTTGTGGTTA AGGCTTTTTA TTTTTGTGAA AATGACTTCA 1020 

GTTATACTAT GGAGGATTTG AAATACATAT TTTAGATTAG TAATGATATC AAACGAATAG 1080 

35 

AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG TCACGGAAGG nAGATAAAT 1139 
(2) INFORMATION FOR SEQ ID NO: 282: 

40 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 6 0 

50 

GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 
AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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w 



20 



AATGAAGTTG TTAAAATAGA TTTATATACT TCAGTGCCTA ACAAACAAAA TATACAAGAT 3 00 

GTTAAAGAAA TGATAGAACA TCAACAAATC GATGCATTAA CATTTTCAAG TTCGTCGGCA 3 60 

GTACGTTATT ATTTTAATGA AGGATTTGTA CCAAAATTCA AGTCGTATTT TGCTATTGGA 4 20 

GAACAAACAG CACGGACCAT TAAATCATAT CAACAACCAG TAACAATTGC AGAAATTCAA 4 80 

ACACTCGAAT CACTAATTGA AAAGATTTTA GAAAGTAGGG GCTAAAAATG AAATTTGATA 54 0 

GACATAGAAG ATTGAGATCA TCAGCGACAA TGAGAGATAT GGTTAGAGAG AATCATGTAA 600 
GAAAAGAAGA TTTAATATAT CCAATTTTTG TAGTTGAAAA AGACGATGTG AAAAAAGAAA 66 0 

TTAAGTCATT GCCAGGTGTA TACCAAATCA GTTTGAATTT ACTTGAAAGT GAATTAAAAG 72 0 

AAGCTTATGA CTTAGGCATA CGTGCCATTA TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 78 0 

ATATAGGTAC TGGTGCATAC ATTCACGATG GTGTTATTCA ACAGGCAACA CGTATTGCTA 84 0 

AAAAAATGTA TGATGACTTA TTAATTGTTG CAGACACTTG TTTATGTGAA TATACTGATC 900 
ATGGTCATTG TGGCGTGATT GATGACCATA CACATGACGT TGACAATGAT AAATCATTGC 960 

CACTACTTGT TAAAACAGCA ATTTCTCAAG TGGAAGCTGG TGCTGATATT ATTGCGCCAA 102 0 

GTAATATGAT GGATGGTTTT GTTGCTGAAA TTCGTCGTGG ATTAGATGAA GCCGGCTATT 1080 

ACAATATTCC TATAATGAGT TATGGTGTCA AGTATGCATC AAGTTTCTTT GG AC CTTTT A 114 0 

GAGATGCAGC AGATTCAGCG CCATCATTTG GGGATAGAAA AACGTATCAG ATGGACCCTG 12 00 

CTAACCGTTT GG AAGCACTT CGTGAATTAG AAAGTGATCT TAAAGAAGGG TGCGACATGA 12 60 

TGATTGTTAA ACCTGCTCTA AG TT ATTT AG ATATAGTTCG AGATGTTAAA AATCATACGA 132 0 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG GAGAATATAG TATGACTAAA GCAGCGGCAC 13 80 

AAAATGGTTG GATAGATGAA GAACGTGTCG TTATGGAACA AATGGTTTCA ATGAAACGTG 14 4 0 

CAGGTGCTGA TATGATTATT ACGTATTTTG CAAAGGACAT TTGTCGCTAT TTAGATAAAT 150 0 

AAGGTTTTAT ATTTATGATT TTCCATAAAC TGTAGGAGGA ATTTACTTTA TGAGATATAC 156 0 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA AACTTTAATG CCTGGTGGTG TAAATAGTCC 1620 

AGTACGCGCA TTTAAATCAG TAGATACACC AGCAATTTTT ATGGATCACG GTAAAGGTTC 16 80 

45 AAAAATTTAT GATATCGATG GTAACGAGTA TATCGACTAT GTACTAAGTT GGGGACCACT 174 0 

TATTTTAGGA CATAGAGACC CTCAAGTTAT TAGTCATTTA CATGAAGCAA TTGATAAAGG 18 00 

TACAAGTTTT GGTGCATCAA CATTACTTGA AAATAAATTG GCGCAgcTCG TTATTGACCG 186 0 

50 AGTACCTTCA ATAGAAAAAG TGCGTATGGT GTCATCTGGT ACAGAAGCTA CATTGG AT A C 1920 

TTTAAGATTA G CACGTGG TT ATACTGGCAG AAATAAAATT GTGAAATTTG AAGGTTGCTA 19 80 
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GCCGGATTCT CCTGGTGTGC CTGAAGGTAT TGCTAAAAAT ACAATTACAG TTCCATACAA 2100 

TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 2160 

5 

CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 222 0 

ATTAAGAGAT ATTACGACTG AATACGGCGC ATTGCTAATT TTCGATGAAG TAATGACTGG 22 8 0 

TTTCAGAGTC GG TTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 23 4 0 

10 

CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTGGTA AAAAAGAAAT 24 0 0 

CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTTAT CAGGAAATCC 24 60 

/5 TCTTGCAATG ACAAGTGGTT ATGAAACGTT AAGCCAATTA ACGCCAGAGA CATATGAGTA 252 0 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 25 8 0 

TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 2640 

20 TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTTGCAG AAATGTATCG 27 00 

AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCTCAA TTTGAAGGTA CATTCTTATC 2 7 60 

TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2 8 20 

25 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2 8 80 

TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 2 931 
(2) INFORMATION FOR SEQ ID NO: 283: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 

T(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

40 

AATTATGAAT GCATTACCAG TATTATTACA AAAGAACAAT TAAAAATGTT TGTTTATGAT 6 0 

TATGATACGC ATCTCATTAA AAATGTAaTG GTTGCAGCAG ACGTGTTAAA GGCAAATGAT 12 0 

ATTCAAGGAC ATGAACCATT AATCGTTAAC CTTCAAACGA TTGATGAAAC ATTACATCGT 18 0 

45 

TTACCTATGC ATAATAGAAA AGACATGATG GTTAATGGCG GTGTACTTAT GGCACATTTA 24 0 

AATGCCAAAA GTGGTCCGTG GTTAAAAGAT GTGCTAAGAC AAATTGAGAT AGCGATTGTA 3 00 

50 ACAGGTAAAG TAAGCAACGA AGAAACTGAA ATTTTGAAAT GGGTGGATAA TCATGTCAAA 3 60 

ATATAGTCAA GATGTACTTC AATTACTCTA TAAAAATAAA CCG AATTATA TATCTGGACA 4 20 

AAGCATTGCG GAATCACTTA ATATTTCACG CACTGCAGTA AAAAAAGTGA TTGACCAATT 4 80 
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CCCAGATATT TGGTATCAAG GTATAATAGA CCAATATACA AAAAGTTCTG CTTTGTTTGA 600 

TTTTAGTGAA GTATACGATT CAATAGATTC TACACAACTT GCTGCGAAAA AGTCACTTGT 660 

5 

TGGAAATCAA TCTTCATTTT TTATCTTGAG TGATGAACAA ACGAAAGGTC GTGGGCGATT 72 0 

TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTTAAGACC 7 80 

w TAACGTTGCA TTCTCAATGA TATCTAAATT TAATTTATTT ATTGCATTAG GGATAAGAGA 84 0 

TGCGATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 900 

TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATG ATGGTATAGA 960 

15 AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 

TATTAGACAT AGAGCAACAA GTATACAATT ACATGATAAA AATAAATTAG ATAGATATCA 1080 

ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 1140 

20 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT ATTTGGAATA GAACGTTGCt 1200 

ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 1260 

TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTTTA ATTAGTGCAG ATATAGATTT 1320 

25 

TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 13 80 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 14 21 
30 (2) INFORMATION FOR SEQ ID NO; 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY ; linear 

~(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

40 

CCAAGTTGCC TAAAATGATT AAGCAAGGTT TATACCCTAT GCnAACGATT GAACAAGAAT 60 

CTGGAGCCAT CCGACTGCCA ACGATTTCTA GAGTGAnGCG TTCATTACAA TGGGGTAATG 120 

^ ATGCTTATAC AATGATTTTA GATCGTATGA ATATTGAAAC AAATGAATAA TAAATGAACG 180 

ATAAACAATG GTTATCTATC TGCACTAATA AGGTAGATAA TCATTGTTTT TTCACGAAAA 24 0 

AATTTACAGA GTAAAAGAAC TTAAATTTCA TATTAAGTCT TTAGAACTCG ACACTTAAAA 3 00 

so ATGCTATAAT CATATGTATG TTAAAAAAGG AGTTTCGGAA AATGTATGAC ATTAAAAAAT 3 60 

GGCGCCATAT TTTTAAATTA GACCCAGCTA AACATATTTC AGATGATGAT TTAGATGCGA 420 

TTTGTATGTC TCAAACAGAT GCAATTATGA TTGGTGGAaC TGATGACGTT ACTGAAGATA 4 80 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 660 

5 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 780 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GG CATTT ATG 84 0 

10 

GCG ACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AG CTATTGCA GATACAATTA 960 

is TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 1140 

20 

AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 120 0 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 13 20 

25 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 13 80 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 1440 

30 TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 1500 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 156 0 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 162 0 

35 CTTATTTGAG CGTGT AC CAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 168 0 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 174 0 

TAAGTTTAAA AACTTATGTG TTGTAGGTGA CT CAG AT CAG TCAATTTATG GTTGGCGTGG 180 0 

40 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT C CAG AAGCGA ATACAATCTT 186 0 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 198 0 

45 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 204 0 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

50 CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 
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20 



35 



(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG CAAGATGATG GCACGATAAA 60 

ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC TCATATGTTG 120 

AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 180 

ATTAATACCT CGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 24 0 

GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 3 00 

CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCATA ACTGGGAACC ACCAACCTTG 3 60 

CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 420 

CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 4 80 

AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 54 0 

CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 600 

CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 660 

AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT ATGGCACCAA CTGGAAACGA 720 

TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA TGCTACAGCT 780 

TCTTC 785 
(2) INFORMATION FOR SEQ ID NO: 2 86: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 
40 *_ (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 60 

AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 120 

TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 180 

ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 24 0 

GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 300 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 4 80 

5 TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 54 0 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 6 60 

10 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 72 0 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 
CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 

15 

(2) INFORMATION FOR SBQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 2 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 6 0 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 12 0 

JO 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 240 

js GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 3 00 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 360 

AAATTCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 4 80 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA 54 0 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

45 AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 7 80 

50 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 84 0 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 
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CATTCCAATT TGCATATTTA AGTTTAATGC a GG AAGAAAA GATAcAAGCA AATCAT CAAA 102 0 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATGAACAACC 1080 

5 AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 114 0 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 12 0 0 

CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 12 6 0 

10 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGGTGAT TTTCCAGTAG 13 2 0 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 13 80 

js GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 144 0 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTGAAAAAT 150 0 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TTATTTAAAA 1560 

20 ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 162 0 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 16 8 0 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAAT CGTCC TA 1732 

25 (2) INFORMATION FOR SEQ ID NO : 2 88: 

Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 288: 

AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTCAAATGA 60 

AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 120 

40 GAATCAAATT AACGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 180 

ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCTCGA ATTTCAGCAG TACAAATTGT 240 

AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 300 

45 

AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 360 

ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 420 

TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG CTAGTTTCTG 480 

50 

GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAGAGA 54 0 

TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 600 
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AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TT CTTCTT AC CAAAACGTCG 72 0 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 78 0 

GAAAGATAAC GAACATTCAC CACTCTTATT CGCAAAACGA CGCAAAGATA AAGAGGAAGA 84 0 

TGTTGAAACA ACAACTAGTA TTGAATCTAA AGATGAGGAC GTTCCTTTAT TATTGGCTAA 90 0 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 960 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 1260 

AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 13 20 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG TGGTTATCCA TTTCGCCTAA CTCATGTGTA 13 80 

TATGTATAAT GCTGCTCATC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 144 0 

cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA AC CTACTTTC TATTCAATAC 1500 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACACG 156 0 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 162 0 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 1680 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 174 0 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTT CGAAAAG AAATTCTACA 180 0 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC ACAGAAGATG ACGAAAAGTC AGCTTACAAT 1860 

AATCTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

CC CTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 2 04 0 

TCTTACCTTT ACGATAAAAT GATTTAGCAA TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

45 AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 2160 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 2280 

50 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 234 0 

CGTGGCACAA CATCAAAGAT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 24 00 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 2520 

TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 2580 

TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 2 64 0 

TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 2 700 

CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 276 0 

GAAACCGCTC AATAAATAA 2779 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

ACTGATGTGC GTTCATCAAA AACAATATAA TCAAATTCAT TTTCATCAAA TTGCTTAAAA 60 

25 TTATCATCTC TAGATAATGT TTGAATGGTT GCAAATAAAT ATT TGG CATC GACATCTCTA 120 

TGTTTTCCTG TCAACAATCC AAAATCACTA TCATTTTTTA TAGGTAATAC TTTTTTAAAT 180 

TCTTCCTTAG CTCTATTTAA AATCCCCTCA TTATGAACAA TAAATAAAAA TTTATTAGGG 24 0 

30 

TTTACTTCTC TAACATCTAA TGCACATAAA ATCGTTTTAC CTGTACCAGT TGCAGATATT 3 00 

ATTAACGCCT TATCTTTGGC TTTATCCCTA ATAGCTTTTA ATGACCTTAA TGCTTCTGCT 3 60 

TGCATTAAAT TGGGTACAAT TTCCACTGAT TTTTTCACCT TATCAGCTAG CAGCATTTGA 4 20 

35 

GTTTGTTCAA CCTCCGCTAA TTTTTCTAAG GAACGGTACT CAAATGATTC TTTATATGAA 4 80 

TTAATCCATT GCTCAGTCAG TGGGGTACTC TTTTGCCATA ACAAGTCAAA TTCACTTTTT 540 

40 ACACTATCAA CTAAATCGCC ATTTTTCATA GTAGACAGTA AAACATTATG CTCATAATTA 600 

ACCTTTAACG CATTAGATGT TAAATTAGAG CTTCCTATTA CCATAGAACT ATAATCCTTA 6 60 

TGCTCAAAAA TATATCCTTT GGCATGGAAT CCAGCAATAT CAGTTAATCT TACCTCTACA 720 

45 

TTTTTTAATT TAAGTAATTC TCCATACATT TTAGGACTAT TAAACCCTAA GTAATTAGAT 780 

GTTAATATTT TCCCTTTAAC ACCCTTATTG CTTAAATCTA ATAGTTGAGC CTTTAAGCTG 84 0 

GcTAAACCGC TTTCTGTTAT AAAAGCCACA GAAAAATAAA ACGTTTCACA TTTTTGAAGT 900 

50 

TCATCTATAA TTGTTGAAAG AACTTTTTCA TTTTTATTAT TTACTAAAAG CTTCGGTGTA 960 

TAATTCCCTT TATGAGAAAT ATGTTTGTCT ATAAACCCTT TATGTAAAGA TTGATTGAAA 102 0 

55 



1114 



EP 0 786 519 A2 

CGCTGGGGCC CAATTTAATT TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 114 0 

TTCAGTTAGA GTTGGTAACT CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 12 00 

TCCAAAATCA TATTCATGTT CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 1260 

CATTTCTTCT CTAATTTCTC TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 13 2 0 

ACnGCCAGGA AATTCCCACA TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 13 80 

AATTTTGTTA TCAGAAAAAA TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 144 0 

CACCCTTCAA TTTAAAATCA TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

AAATCAATAT TAAAAATTAA TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 156 0 

ATACACATCT ATTATATAAT ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

TTTGGGGCGT ATTTCTTTAT AATTCATTAC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 1680 

TATTAAGTCT AACTGAGATT TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 174 0 

TAGAACGAGC TAATGATAGT TTAGTTAAAC ATTTTTATG a GCAGCAATCT GACATTGAAC 18 00 

AGCGAGAAGG TTTTGAAAGT AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAATTCG 186 0 

GTCTTGGTGA AGGTCGACTT AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 192 0 

GTTATTTAAA TGCCCAAACA AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 198 0 

TTTCcAACTG AATTCGCCC 1999 
(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA ATGATTCATA TGGTTCATAA AAATCACGCG TCGTAATATA 120 

ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 180 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 24 0 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 300 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT TTTTCGCAAT ATCTTGCTAA 360 

ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC TACCTCTAAA 420 
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TTGTTTGATC TGTGTCGCAT AAGCTTCATC AAATAGTACA TCAGTACGTG GAACACCTGT 54 0 

AGGCACTACA TTTTTCTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA TATCGGTTTC 6 00 

AGATGATACA TAAGCTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC 660 

CTTTTTACCA GTACGACTAA AGCCAACTGT TTT AAA GGC A CCAACGGCAT GCCATACTTG 720 

AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA AGTCATCAAC 78 0 

AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA TGATGCCACG 84 0 

TCTATCTGTA ATATTCGCTT TAAAAACAGT GTGAATATCA TACTTTTTAT CTAAATTTTG 900 

ACGTAACATT TCGTTATAGA TGTATTCAAA GTTTCCAGAC ATCGTTGGTC TAGAGTCTGA 960 

TGTGAACAAC ACCGTATTCC CTTTTTTCAA GTGGAAAAAT TTCGTCGTAT TAAATATCGC 1020 

TTTAAAAATA AATTGTCTTG TATTAAATGA TTGTTTGCGG AAATACTTAC GTAATTCTTT 10 80 

ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA CATCAAGGAC 114 0 

AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT TCTTCGAAAT 12 00 

GCCGCCTTTT CTAAATTCTT TTAGGTAAGT CTGCAATAAG AAATTGATTT TACCATTTTG 1260 

TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT ATGCATCATT 13 20 

AATCACATCT GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA CTAATAAGTA 13 80 

CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC GACAAACGAA 1440 

TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG TTAAATTTCT 1500 

TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACTA TGTCCTTCAA CGAATAACTG 1560 

AACACGTTCC CAATAGATTT TATCTATATA TATCTTACTT TTAACCAACG TTAATTCATC 1620 

CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGTAG ACAGGTCTGC 16 80 

TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA AATTCTTGAC CTTTTAACAA 1740 

CGCTAATTTA nCTACAATAT CTGGGTATTG AATGTATAAG TCTACAACAT CTTGGAAATC 1800 

TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA CTAGACGTGT 1360 

ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT CTGGTGAAAT 1920 

GTAATCGATC ATT 193 3 
(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGGTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 6 0 

5 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 120 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 13 0 

TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 24 0 

JO 

TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 300 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 360 

/5 GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 420 

GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 4 80 

AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 54 0 

20 T GAIT ATT CA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

25 TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 7 30 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 840 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAAT ACT AT C 900 

30 

ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 950 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 1020 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 10 30 

35 

AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 114 0 

CCCTAGAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

40 AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 12 60 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 1380 

45 AAGGAAACTT CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA AGAGTTCCTT 1440 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TATGAATGCG 1560 

50 

AGAATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 16 20 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 16 80 
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GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 180 0 

GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 1860 

5 AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 1920 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT 1980 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 2 04 0 

w 

TAGATAATC 204 9 
(2) INFORMATION FOR SEQ ID NO: 292: 

/5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 92: 
ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 60 
25 TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 12 0 

TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT TTAGATAAAT 180 
GAATCGTCCA ATCATTTCCA GTACCAATAT G AT ATAAAT C TGAAAATGAG TCTTGATTGA 24 0 

30 

CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 30 0 

CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT TGATGATAAA 3 50 

TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 420 

35 

TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 4 30 

CCTCATTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 54 0 

40 GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 600 

CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 660 
CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 720 

45 CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 7 80 

TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 840 
ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG TTAAGTTTTC 900 

50 

AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 94 2 

(2) INFORMATION FOR SEQ ID NO: 2 93: 
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(A) LENGTH : 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

W TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AAC CTTCTTC ATTTCTTTAT 6 0 

GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 120 

CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 180 

15 GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 24 0 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 300 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 36 0 

20 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 42 0 

TTTCATGTGC ACCTTTCTTT GAGGTTGTAT CT A CATC CAT CCTAATTATG CGCGCATCTT 4 80 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTACT TGTCGAATGT 54 0 

25 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG 660 

30 GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 72 0 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 78 0 

ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTT CAGGT A 84 0 

35 AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCCTAAAAT GACTGGACAG TGATGATATT 96 0 

CAc'FTCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 102 0 

40 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 10 8 0 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATT CATCAT 114 0 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 12 00 

45 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 12 6 0 

TTCCTTGG 12 6 8 

50 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

5 TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 60 

CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 120 

CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 180 

W AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCG AT ACGTT 24 0 

TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 3 00 

CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 360 

15 

ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA CCTGGATAAT 420 

TCGGTGCATA ACAACTAAAT C CTTGGT CAT TTAACTCAGC TGCAAGATGC TTCACATCCC 480 

GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 54 0 

20 

ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGakGT 600 

TkGCTAAACG CGTTGtCGTC GATGATTAA 62 9 
25 (2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

in 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

35 

TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 60 
AGA-FAATATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 120 
TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAGCAT ACACAGACAA 130 

40 

AATTATAGAT GCATTTGGTA TGAGACCTTA TATTAAAAAG AAAGTTAAGA AATATTCAAT 240 
GGGGATGAAG CAAAAGTTAG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 300 

45 CTTGGATGAG CCTACAAATG GkATGGATCC AGATGGCTCa ATTGATGTGC TGACTACAAT 3 60 

TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCtAATA TCAAGTCATA AGTTAGAAGA 420 
TATTGAATTA ATTTGTGATA GAGCTGTATT TTTAAGAGAC GGnCATTTTG TTCAAGATGT 480 

50 AAACATGGAG GAAGGTGTTG CATCTGACAC AACGATAGTT ACTGTTGATC AT AAAG A CTT 54 0 

TGATAGAACT GAAAAATATC TTGCAGAGCA TTTCCAATTA CAAAATGTCG ACAAAGCAGA 600 
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ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 72 0 

CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 7 80 

D TATAGTATAT TTAAaArT CC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 84 0 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 90 0 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 96 0 

10 

AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 10 2 0 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 10 8 0 

GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 114 0 

GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 12 0 0 

TTGTTTTTAT TCTTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 126 0 

20 GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 132 0 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 1380 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 144 0 

25 ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 150 0 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 156 0 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

30 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 1680 

TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 174 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

35 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 186 0 

ACGCGTACTT CTAAATTTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

TTTTTGATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 2040 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

45 GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

50 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 234 0 

TTGGTTTATA TCAATATCGT CATACGAAAA CTGTGGATTT GTCTAATCTT GAAATAAACG 24 00 
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TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCTTACCGTT AAAGTGAGAG 2 52 0 

AAAAGGATAA CATCGTTAAG GGGATAATAT TAGTAAGAGA TGAAAAGATA CATACTAATT 2 580 

5 TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 2 64 0 

AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 2700 

TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 2760 

10 

AGCTTTAGGT CTTAAAGTTw TAAAAAACGA ATGAaTAATT TTATTGGGAT GAGTGAC 2 817 

(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 
' 5 (A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
TCTGTTAAAA TGATTTTTCT TTTAnAAAGG CCGnAAATCA ATGTTCGATT nTTATTTGCA 6 0 

25 TTATGGTCTC GATATTGGTA GAATATCAAA TGGTTAAATG AGAAAAACTT GGAGGTGCTC 120 

ACATGTCATC AATCGTAGTA GTTGGGACAC AATGGGGAGA CGAAGGAAAA GGAAAAATAA 180 
CGGATTTCTT GGcAGAACAG TCAGATGTTA TCGCGCGTTT TTCAGGTGGT AATAATGCAG 24 0 

30 

GCCATACCAT TCAATTTGGC GGAGAAACAT ATAAATTACA TTTAGTACCA TCTGGTATCT 300 
TTTACAAAGA CAAATTAGCG GTAATCGGTA ACGGAGTCGT TGTTGATCCA G TTG C ACT AT 36 0 

TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAATTTA CGTATATCTA 420 

35 

ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA TGAATATGAA GAACGTTTAC 4 80 

gTGGtGACAA TAAGATTGGT ACAACTAAAA AAGGTATCGG TC GAG CAT AT GTAGACAAAG 54 0 

TTCAACGTAT CGGTATTCGT ATGGCAGATT TACTTGAAAA AGAAACATTC GAAAGATTAT 600 

40 

TAAAATCAAA CATTGAATAT AAACAAGCAT ATTTCAAAGG TATGTTTAAC GAAACATGTC 66 0 

CATCATTTGA TGATATCTTT GAAGAATATT ATGCAGCAGG TCAACGTCTA AAAGAATTTG 720 

45 TAACAGACAC ATCAAAAATC TTAGACGATG CATTTGTAGC AGATGAAAAG GTACTTTTCG 780 

AAGGTGCGCA AGGTGTAATG TTAGATATCG ACCATGGTAC ATATCCATTC GTTACATCAA 84 0 

GTAATCCAAT TGCAGGTAAC GTTACTGTTG GTACAGGTGT AGGTCCTACA TTCGTTTCAA 900 

50 AGGTAATTGG TGTATGTAAA GCTTATACAT CACGTGTTGG TGATGGTCCA TTCCCTACTG 960 

AATTATTCGA TGAAGATGGA CATCATATTA GAGAGGTTGG TCGTGAATAC GGTACAACAA 1020 

55 



1122 



EP0 786 519 A2 



TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 114 0 

TGAAAATCTG TACAGCTTAT GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 1200 

5 TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 12 60 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 13 20 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 13 80 

w 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 144 0 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TG CTAGTCAT AATTTAGAAG 1560 

15 

rTTAgATGAt AtTtAACGAA AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 297: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TTAGAAGTAA GCACTTTAAT ATCTTTACCT ACCCATGTAC CAACACCTTC TTTAGGCTTT 6 0 

30 

GGATTTTCAG CATGGTTATT TGATTTATTC ACCTGTTCAG ATCTATTTAC TTTATTACCT 12 0 

TTATTCGGGT TCTCTAATAC AT CAAATTT T AATCTCGGTG AATAAAAAAG ATATATTAAA 190 

AATCCAACTA AAAATAATAC ACCGACGACT CTTATAATTA ACTTTTTCAT CAATCAACCA 240 

35 

CCTAAAAAGT ATTAATACTA TTGTAAAAAA CAACACATTA ATTAGCAAAT TTTCAACACT 3 00 

GACATAACTG TGTCGTTTCG ATAAAACAAA ATCATCTTCA GG CAT AATTT TAACGTCATA 3 60 

AAGATTTTCC CTACACTCTA TATCATAACC TATCTTTATG TTTTCAGGTT G AATTT CATT 420 

40 

ATTAAGATTA AAATATGTAT AAAAAAATGG ACAGTTAAGG TATCAAATGA ATACCATCAA 4 80 

CTGTCCGACT ATTCTTCATC AAAAAACCTG ATAAAACAAA TTGCCTTATC AGATTAGTAT 54 0 

45 CATTTGTATA AG CAT ATT AA TGACCAAATG TTGCTTTAAT CAGTGATGTC GTTTCTCCAC 6 00 

CTGGATATAA TACATATAAT AATAAGTAAA CTGCTACACC TGTAATTGCA GTACAGAACC 6 60 

ATATAACTGA AGCGAATGGC CCGAATTTGC GGTGTACATT ATATTTATCT TTAAATGCAG 720 

50 TAATAATTTG AACTAGGCCT AGAATACCAC CAATTGTTGC TAAATTAATG TGGAAAAATA 780 

AGAAAATCGT ATAATATTTC TTAATTGATG CTGGTCCGCC AAATGCTGTA TTACCGATAA 840 
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TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA ATGGCAATTA 36 0 

AAATTGCACT AATGACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA ACGCCCATAT 102 0 

5 TTTTCATCCT AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 1080 

CGAAAAAGAT CACTAAATAA TTTAGTGAAT AT AT AAA CAT TTGTGTTGCC CATTTTGTTT 114 0 

GATCTGAATT TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA TTTAATAAGG 12 00 

10 

TrGCTAACAC TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 1260 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

;5 TTGGAATATT TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA TAAAAATGAA 13 30 

TTGGtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTC CTT 1440 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGTGT 15 00 

20 TCCATGTTGT ATGT CTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 156 0 

TAAGACCAAG TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 16 20 

TCATACCAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 168 0 

25 TATTTTGTTT ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA TTTAACGCAC 174 0 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA GGTATTGATG 18 00 

ATAAGAAGGA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA GCCGGAATTA 186 0 

30 

AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 1920 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 1980 

TAAGATTATG AT AT CTT AG A TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 2040 

35 

AAATTAACGA CAAGAGAAAA TGTCAATTTT GTGACACAAA TAACATTTAA TTTATTGCTA 210 0 

TAATGTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC ATCTAAAAGA GATTAATATT 216 0 

TAAGCTTCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 2220 

40 

ATTAAGTTCT TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 2 2 80 

GTTGGATCCC ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 2340 

45 GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA TGGGGTGTAT AAATTGTTTG 24 00 

GCAAAAAGAA TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA TTTGTACAAC 24 60 

TTGGTGGAGC CTTAGTTACC AAAAC CGG AT CAGCTGATGG TTGTGGTTCT TCTTGGCCAC 2520 

50 TATGTCATGG TGCGTTGATT CCAGAATTCT TTCCTATTGA T AC G ATT ATT GAGTTAAGTC 2580 

ATAGAGCCGT TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 2640 
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w 



TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 276 0 

cATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 282 0 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 288 0 

GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 2 94 0 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA 3 00 0 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 3055 
(2) INFORMATION FOR SEQ ID NO: 2 98: 

;5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
TTCTTATTTA AAGAAGTCAT TTTTAGAAAT TGTTGAGACT TTAAAAAATG ATCCGTATAA 6 0 

25 AATAACACAA TCTTTTGAAA AATTAGAGCC TAAATATTTA GAGCGATATT CAAGAAGAAT 12 0 

TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA AATAAAGAAG TATTAATACT 180 
ATCGGCATGG TCACATTATG ATTAATGAAT ATTCAATATC TGAATAACTT TAATGATAAG 24 0 

30 

TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC GACGTTAATG CGTTGTTATC 300 
TCTACACTTT TTAATTTTAT AATAGCGCAA GACTAAACAG ATTGAAATTA GTAACAATAA 360 
AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT TTTTCGGAAT ATTAAATTTT 420 

35 

AGAAATATGA GGTTTTTAAG CGGATTCCTC ACAAAATTTT AAAAATATTT AAGCCTGAAA 480 
ATGATAAAGC GGTAGGGAAC GTTTTTCTGA AaGTTAGTGA TACAATAGTT TTAAGTTGAA 54 0 

4Q ATACAGGAGG ATGAATAACA TGAATCAGTC AGTCAAATTA CTTAAACATT TAACAGATGT 600 

AAACGGCATT GCTGGTTATG AAATGCAAGT TAAAGAAGCA ATGCGTaACT ATATAGAGCC 660 
TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT TTTGGAAAGA AAAATGCTGA 720 

4$ GAATGGTCAA TACTCAATTA TGGAnTTC 74 8 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 
5Q (A) LENGTH: 4718 base pairs 

{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA AtTTAAACAG GCGAAGTCTT 6 0 

CAATAAGTGA AGATGTCCAA ATTATAAAAA ATACATTCCA AAAAGAAAAG TTAGGTACAG 120 

TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 180 

GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC GTTTGTTAcC 24 0 

TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA ACAAAGTTGG 3 00 

TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA CCATTGCGAC 360 

AAAAGGTATT TCATTGGCAA ATGCGGTTGC TAATATTTTA AATTTACCAG TAGTAGTGAT 4 20 

TAGAAAAGAC AACAAGGTGA CTGAAGGTTC TACAGTTTCA ATTAATTACG TTTCAGGATC 4 80 

TTCAAGAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA ATTCAAATGT 540 

TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AATGGTGTTA TGAATTTAAT 6 00 

GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA AAGAAGTTAA 660 

ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG ATGAATATAA 72 0 

TCaAGAGTTT AACGTAGAAC CTGGCAACAG TTTATCTAAG TTTTCATAAA AGGAGTTTTA 730 

GTATTATGAA AATCATTAAC ACAACAAGAT TACCGGAAGC AC TTGGACC A TATTCGCATG 84 0 

CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT ATTGATGGAC 900 

ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT TTAAAGGTTG 960 

TTTTGGAAGA AG CAGG ATCT GATTTGAATT CTGTTGCGAA AGCGACCATT TTCATTAAAG 102 0 

ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGGTCA ATATTTTAAT GAACACAAGC 10 80 

CAGCGCGTAG TTGTGTAGAG GTTGCGCGTT TGCCAAAAGA TGTGAAAGTA GAAATTGAAT 114 0 

TAGTAAGTAA AATTAAGGAA TTATAATTTT CGATTAATAT GTTTAATCAA GCTTCTAAAT 12 00 

AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 12 60 

TAGAAAAATA CAAACAGATG GACGAATGAA AGCACTCGTT TCCATTACAT TAGATGAAGC 13 20 

TTTCGTAATT CATGATTTAC GTGTAATTGA AGGAAACTCT GGCTTGTTCG TTGCAATGCC 13 80 

45 AAGTAAACGT ACACCAGATG GTGAATTCCG CGACATCGcg CATCCTATTA ATTCAGATAT 144 0 

GAGACAAGAA ATTCAAGATG CAGTGATGAA AGTATATGAT GAAACAGATG AAGTAGTACC 15 00 

AGATAAAAAC GCTACATCAG AAGATTCAGA AG AAG CTTAA TCAATTTTAT ATTTAGCGAT 156 0 

GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC TTTGTCGGCA 1620 

GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA AATATATACG 16 80 
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TGaTGCTCGT ATTTTTGAAG TAAGAAAAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 18 00 

CAATGCCTTT TATATGTTGA AAGAGTATTG CaGATTAAAT TaTAATAATG ACGAaGgTAA 186 0 

5 AATTTAATGG GGGTTAATGT TCATGCGAAG ACACGCGATA ATTTTGGCAG CAGG TAAAGG 192 0 

CACAAGAATG AAATCTAAAA AGTATAAAGT GCTACACGAG GTTGCTGGGA AACCTATGGT 198 0 

CGAACATGTA TTGGAAAGTG TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 204 0 

10 

ACATGGTGCT GAAAGTGTAA AAGGACATTT AGGCGAGCGT TCTTTATACA GTTTTCAAGA 210 0 

GGAACAACTC GGTACTGCGC ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

GGTACGACAA TCGTTGTATG TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2220 

15 

TTGATTGCGC ATCACGAGGA TGCTAATGCT CAAGCAACTG TATTATCTGC ATCGATTCAA 228 0 

CAACCATATG GATACGGAAG AATCGTTCGA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 234 0 

20 GAAGAGAAAG ATGCAACGCA AGCTGAAAAG GATATTAATG AAATTAGTTC AGGTATTTTT 24 0 0 

GCGTTTAATA ATAAAACGTT GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 24 60 

GGTGAATATT ACCTCCCTGa TGTATTGTCG TTAATTTTAA ATGATGGCGG CATCGTAGAA 252 0 

25 GTCTATCGTA CCAATGATGT TGAAGAAATC ATGGGTGTAA ATGATCGTGT AATGCTTAGT 258 0 

CAGGCTGAGA AGGCGATGCA ACGTCGTACG AATCATTATC ACATGCTAAA TGGTGTGACA 264 0 

ATCATCGATC CTGACAGCAC TTATATTGGT CCAGACGTTA CAATTGGTAG TGATACAGTC 2700 

30 ATTGAACCAG GCGTACGAAT TAATGGTCGT ACAGAAATTG GCGAAGATGT TGTTATTGGT 2760 

CAGTACTCTG AAATTAACAA TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2820 

GTTAATGATG CTAG CGTAGG AG CG AAT ACT AAGGTCGGAC CGTTTGCGCA ATTGAGACCA 2 88 0 

35 

GGCGCGCAAT TAGGTGCAGA TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAGCAGAT 294 0 

CTTAAAGATG GTGCCAAGGT TTCACATTTA AGTTATATTG G CG AT GCTGT AATTGGCGAA 3000 

CGTACTAATA TTGGTTGCGG AACGATTACA GTTAACTATG ATGGTGAAAA TAAATTTAAA 3060 

40 

AcTATCGTCG GCAAAGATTC ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 312 0 

ATTGGTGATG ATGTATTGGT GGCAGCTGGT TCCACAATCA CAGATGACGT ACCAAATGAC 3180 

45 AGTTTAGCTG TGGCAAGAGC AAGAGAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 324 0 

ACGTATTTAA AATGGCTAGG ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT 3300 

GATTTAATGA TTCGCATAGT AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCGTAAGT 3360 

50 AAATAATTGG AGGACTATAA ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 3420 

TTTCATTGAA AGGAAACGAA GCATTAGCGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 34 80 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTGTG AATCTACATT 3 6 00 

TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC TTCTGCAGCA ACAATCAATA 3660 

TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 3 72 0 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGct GG CGCAACTCGT ATGATTGCGT 3730 

TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT TCCAATTGAC CACTTAATGG 3 84 0 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3 900 

TCGTTGTTcA CCAGACCATG GCGGsnTTAC ACGTGCACGT AAATTAG CTG ACATTTTAAA 3 96 0 

AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 4 02 0 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 4 0 30 

AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 414 0 

TTGTTGTACA CACCCTGTTT TATCAGGACC GG CTAAAGAA CGTATCGAAA ATTCTGCTAT 4200 

AAAAGAATTA ATCGTAACAA ACT CAATT CA TTTAGATGAA GATCGCAAAC CATCTAACAC 4 2 50 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT ACGAAAGAGA 4 320 

ATCAGTTAGC GTATTATTTG ACTAATATTT AAAAGGCGTT TGACGAACAT ATTCCAAACG 4 3 30 

TGTATAATAG TTTCGTTCGT GATTATACGA ATAAATAAAC ACTTGCAAGC AACGATGATG 444 0 

TTGATGGGTA AGTGAGGTGC TCGTTTTGAG CAAAAATGAA AGGTGG AAA T GAGAATGGCT 4 5 00 

TCATTAAAGT CAATCATCCG TCAAGGTAAA CAAACACGTT CAGATCTTAA ACAATTAAGA 4 56 0 

AAATCTGGTA AAGTACCAGC AGTAGTATAC GGTTACGGTA CTAAAAACGT GTCAGTTAAA 4 620 

GTTGATGAAG TAGAATTCAT CAAAGTTATC CGTGAAGTAG GTCGTAACGG TGTTATCGAA 4 6 30 

TTAGGCGTTG GTTCTAAAAC TATCAAAGTT ATGGTTGC 4 718 
(2) INFORMATION FOR SEQ ID NO: 3 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 6 0 

TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 

ATTTCTTGGT A TTTT CAT CG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA 
CAATGCTCGG ATATTCTTAG TTCTTGGATT 
5 GGGATTTGTA CCGTTCTTTA CATCAGGTAT 

TGGATGGTTT CAAGGTATGG GCTGGCCACC 
TGTAAGTGAA CGCGGAAGTA AGACTGCCCT 

10 

TATTATGGCA CCTATTGCTG CTTGGGGTAT 
TTTAAAAGGT TTCGAAGGTG T ATT CATTT A 
AATTTCATAC GTATTGATTA GAGACACACC 

15 

TTATAAAAAT GACTTTGCTA CAAGCGATAA 
AGAAATTTTA TTTAAATATG TACTGAACAA 

, TATATTTGTT TATTTCGTGC GTTATGGTGT 

AGAAAAACAT TTCGACTTAA AAGCATCAGG 
AATTCCTGGT A C ATT ATT AT GTGGTTACAT 

25 ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT 

TCCTCCAGGC AATGCTTGGT TAGACAATGT 
TGGACCAGTT ATGTTAATTG GTTTACAAGC 

30 CACAgcAGCT GGATTAACAG GATTATTTGG 

CGTCTTAGGT GCTGTAGTTG ATAAATTCGG 
AATTAGTGTG TTTGCAATGT TGAGCTTTAT 

35 

CGTTCATCAT TAAATGATAA AAAATAAAGT 
TTCATCTCTT ATAAGTTCAA CCATATGACT 

TGC CACTTTT GTGTTTGCCC TGCTGTTTTA 

40 

CTAAAAGCCA TTTTCCCTTG ATACTTAGAT 
TCCCTAGGAA ATAGAAAGTA GCCATTTAAT 
TCAGCAAACT CTTCAATTTT GTATGGCCGA 

45 

GTCACGAAAT ATCCCGATTT AGTCGGTGTT 
TCTTTAAAGC TAAATGTTAA TGCyTCGTAA 
50 TTAACTTCTG ATTCTTCACT AAAAATATTT 

CAATCTCACT TTATTTTTTC ATAATCGTAT 

55 



GTTCTTTATG GGTACTGTAA gTGATCGGAG 300 

AGCACTCACT GCTATCGTCA ATTTGTTAAT 36 0 

CGGTATTATG TTTGTCCTAT TATTCTTAAA 42 0 

TTCAGGCCGT GTTCTCGTTC ACTGGTTTAG 480 

TTGGAACGTT GCGCATAATG TTGGTGGAGG 54 0 

TACAACAACT G CATTT AT CA ACTTTGGTTA 600 

CCCTGCACTC TTAGCACTTA TCATTGCCGC 660 

TCAATCTCAA GGTTTACCTC CAATCGAAAT 72 0 

GAAAACATTA GAAACAGAAT TAACTACAAA 7 80 

TAAATGGGTA TGGGCAATTG CCTTTGCAAA 84 0 

ACTTGATTGG GCGCCAGTCT ACTTAAGTGA 90 0 

TTGGGCATAC TTCTTATACG AATGGGCTGG 96 0 

TTCTGATAAA TTATTCAAAG GTCGTCGTGG 102 0 

CACAGTATTT GTATTAATTT ATTGGTTAAA 10 80 

CTCATTAATT GCCATTGGTT TCTTAATATA 1140 

ATTAGATTAT GTACCTAAAA AAGCAGCTGG 12 00 

TTATCTGTTT GGTGCTGTAA TGGCCAACAT 12 60 

ATGGGATGTC GGTTTTATTT TATTAACAGC 13 20 

CCTCACTTGG AAT AAAG TAG GACAAGAAAC 13 80 

CATATGGTTA TCTTATCGAA AGATGATATA 144 0 

TTTTATTAGT ATTCAAAAAA ATATTTACAT 1500 

TTCAATTGAT TACACCACTT AGGATAAACT 156 0 

GAAGCTAAGA TACCTTTTTC TACCAATAAT 1620 

TCATCATCGA TAACAGCAAC AATCAGGTAA 16 80 

TTATAATTAT CtTCGTCTTT TGTCCAACAT 174 0 

TTCTTAGCTA ACCTACTTTG ATATGTTTCC 1800 

TCTTGATTAT ATTTTTCTTC AGTTAAATCT 1860 

TTCAACAGTA TTTTAGATTT ACACATATGC 1920 

CATATATTTA TTTTTTTCGA AAAATACACT 1980 
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TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 2100 

AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 216 0 

AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 222 0 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 22 80 

TGATGCCATT GTTACTTTCA AAAT AAATG t CGCCATCTCA ACAATGGCTT GATCAGTAGT 2 34 0 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 24 00 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 24 6 0 

AAAG'ITATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

15 

aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 2580 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 264 0 

20 AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 2700 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 2 760 

ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT aAATCAAAAC CAAAAATTAA 2 820 

25 AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 28 80 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 2 94 0 

ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 3000 

10 

TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 3060 

TGCAATACCA C CAT AAAT AG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 312 0 

ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 3180 

35 

T 3181 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAAT AG CG AC CATGATTTGT GATTATTTTT AGAACCACAA 180 



55 



1130 



EP0 786 519 A2 



TTATTAATAC ACTATTTTAA GTATCTCTAA 
TGGAATGCAT CAACTGGCAT AACAAATACA 
5 TATGGAATGT ACGAATCCGC ACTGCCTCCC 

CTATTACCAC ACGTTTGATT AATCACAGAC 
CCACATAAGA ATGTTGTATT ACCCGCTCTT 

10 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT 
ATCGCTATAA TCATTTTCAT TTTATAACAC 
AAATATTTGA TGATAGTTTG ATACGTGTCT 

15 

TTAACGCTTT TGAACCGTTG TGATTCATTA 
TTTTCGTGAA CTTTAAATCT TCTTGATCTA 

20 TACGTTCGCG ACCTACTTCA GCACTAACAT 

CATTTATTGC AAATTCGTTT AATG CTCTT A 
CTTGATAAGC TAATGAACTA TCGATATAGC 

25 AAGCTGGTAT GACCTTTAAT ACAAGATGTT 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT 
GGAACACCAC CTGGTT CTCT AGTCATAATG 

30 ACTTCATTAA TTACAGTTGT TTTTCCAGAG 

GACATTTTAT TCATCCTCAA CTAAAATTTT 
AGTTTCCAGA TATTCATTTA CCAATTCTAT 

35 

AATAATAGGA ATcCCTGGcG GATACGGGAC 
TTTTcAAATC ACACCAAGTA ACATATTCAA 
TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA 

40 

TTTTACGTAA AAGCGAATCA AATAAATACG 
CTAATGCTTG ATAGTCGTCC GCTAATTCTA 

45 GAATATCATG ACCTGTAAAA CCTTCATATT 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC 
ACACGGTACT ATCATATGTT TTATAGAACT 

50 AATACGAAGG ACTAGATGTT TGGAAGTAGC 
GTGCATTTTT ATGAATATAA AGTACCGAGC 

55 



TTGATGATTG TATTATAGAA TTAAAATTGA 3 00 

GTAGCACCGC CAACTTCAAC TTCAACTGGA 360 

ATAGGTGTAA TAGGTGAAAC CAACTGTTCT 420 

AATATTTCAT CTACACGGTC ATCATTGACA 4 80 

AAAAACCCAC CTGTTGTTGC CAATTTTGTT 54 0 

GCAAGTTCCT GACTATCTTG ATCTTGTACG 6 00 

CTCTTCTAAC AATTATATCA TATCTTTTCT 66 0 

TCAACAACAT TTTCAAGAGG TTGATCTGCA 72 0 

TGAATGATTT CTTGGTAACC TTCAATTACT 78 0 

ATCTATTTTG ATCTCTTGAA TTTTTAATAA 84 0 

TTAAATAAAT CGTCAAGTCT GGaTATAATC 90 0 

CTTCTTCAAC GCCAATCCCT CTAGCATAAC 96 0 

GATCACACAA CACAACCTTA CCTTCTTTTA 102 0 

CTCTTCTAGA TGCAGCAAAT AACATTGcTT 1030 

AATACAATTT TACGTATTTC TTCACCAGTA 114 0 

ACATCATAAT CTTTTACTAA TCTATGGTAA 12 00 

CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 12 60 

ATTATTTTTA ATTCCTTCAA CTATCATTCC 13 20 

CATATTTTCA GTTATTGTTT CTCCTTTGAA 13 80 

AATATGTCGC GCCAGAACTT TACCTTTgCC 1440 

AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

CTGATTTTTT CGGTAAAATC ATATCTTCAA 15 60 

TATCATCATG ATGCCATAAC GGCAATATCG 1620 

AATAGATATG TGCATTCATA AACCAATTTT 16 80 

TTATCAGCAA CTTCAACGGA TCATCAACTT 174 0 

ATTCGATTAA TTGCGCTCTC TTATCAAAAA 1800 

CGGCAGCTGA CTCTAAACTA GC CAT AATCA 1860 

TT AG AT ATT C TATAATAGTT TCTCTATAAG 1920 

CCATCGTTAA AGCTGGTAAC GTTTTATGAA 1980 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 2100 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

GTTTGTGACC GTCATTATTC AAACGG CTT A AATTAACTTT ATTATAATGA TTCGTTAACG 2220 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 2280 

CAGATTTATG TACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 2 34 0 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 24 0 0 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 24 6 0 

AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 2 520 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2560 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 264 0 

GTTACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACTT 2 70 0 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2760 

TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2 820 

GCTGTTCACT TTTTATAATA CTT CTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 2 88 0 

AATAATCAAA AG TGC A CATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 2 94 0 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 3 00 0 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 3 06 0 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 312 0 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TT A CTTTTT A TTTTG A CGTT TT AG ACATAA 3180 

35 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 324 0 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC TTCTTTCCTC 3300 

TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 3 3 60 

40 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 3420 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 34 80 

45 TTTTTG CCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 3 54 0 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 36 00 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTCAGATTCA AACGTTTTCA 3 660 

50 CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 3780 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3 9 00 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3 960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG 4 0 20 

GAACCTATT 4 0 29 



(2) INFORMATION FOR SEQ ID NO: 302: 

10 

[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 302: 

20 GATGCAATAG TTGAGTAGTT ATAAGTAGCT ACATACAACC GCTCAAATAT AGGTTCAAGA 6 0 

ACATTTTTAA TGAAGAAAGC GATTTTCCCm CAGAGTGCAA ACGCTTGACT CAATAACGAA 120 

ATGTCACCTA TGTATGGCGT GACTTATTAT ACAGGaGGTG CAAAGTATGT TTGCTATTAT 180 

25 TGAAACAGGT GGAAAACAAA T CAAAGT AG A AGAAGGTCAA GAAATCTTCG TTGAAAAATT 240 

AGACGTAAAC GAAGGAGATA CTTTTACATT TGATAAAGTA TTATTTGTAG G T GG AG AT T C 3 00 

AGTTAAAGTT GGAGCGCCAA CAGTTGAAGG TGCAACAGTT ACTGCTACTG TTAATAAACA 3 60 

30 

AGGTCGCGGT AAAAAAATCA CTGTATTCAC ATACAAACGT CGTAAAAATT CAAAACGTAA 420 

AAAAGGCCAT CGTCAACCAT ACACTAAATT AACAATCGAT AAAATCAACG CGTAATTATT 4 80 

ATGATTACTG TTGATATTAC AGTTAATGAT GAAGGCAAAG TAACAGACGT T ATT AT GG AT 54 0 

35 

GGCCATGCTG aCCATGGTGa ATATGGTCAT GATATCGTTT GTGCTGGAGC TTCAgCTGTA 600 

TTGTTTGGTA GTGTTAATGC GATTATAGGA TTGACATCTG AGAGACCAGA TATCAATTAT 6 60 

GACGACAATG GTGGTCATTT TCATATAAGA AGCGTTGATA CAAACAACGA TGAAGCGCAA 720 

40 

CTAATTCTTC AAACAATGCT TGTGTCTTTA CAAACTATTG AAGAAGAATA TAATGAGAAT 78 0 

ATTAGATTAA ATTATAAGTG AGGTGCATTC CGATGTTAAA ATTAAACTTA CAATTCTTCG 84 0 

45 CATCTAAAAA AGGGGTAAGT TCTACAAAAA ACGGACGTGA CTCTGAATCA AAACGCTTAG 900 

GTGCTAAACG TGCTGACGGT CAATTCGTAA CAGGTGGTTC AATTTTATAT CGCCAACGTG 960 

GTACTAAAAT TTACCCTGGT GAAAATGTAG GTCGTGGTGG CGATGATACA TTATTCGCTA 1020 

50 AAATCGACGG CGTTGTTAAA TTCGAACGTA AAGGTCGCGA CAAAAAACAA GTTTCTGTAT 1080 

ATGCAGTAGC TGAATAATTT TGTCTAGTTA ACACCAGAAG TGAATCTTCT GGTGTTCTTT 114 0 
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AGACGTTATA CTAAATGTGC AcTGTATAAG AG CCCCT AAT CACTAAACTA TAAGGGGGAC 12 6 0 

AAAGGAATAC AGTTGCAGCG TTTAAAGAAT AAACTGTACC ACAATTGGTG CTGAGAAATA 13 20 

5 

TAAGTATT7T AAAGCAAAGA TTTATAAAAG TAACTGCATA AGAGCCCCTA ATTATTTACA 13 90 

ATATATAAGG GGCTCTAATA TGCTATAATT ATTGGGAAAA TGAAAATTAT ATGTAAAAGA 144 0 

GGTGAGATAT ATGTTTGTCG ATCAAGTCAA AATATCTCTT AAAGCCGGTG ATGGTGGTAA 15 00 

10 

TGGTATTACC GCATACAGAA GAGAAAAATA TGTACCATTT GGTGGACCAG CTGGCGGTGA 1560 

CGGTGGTAAA GGTGCTTCAG TCGTATTTGA AGTGGATGAA GGTTTAAGAA CGTTATTAGA 16 20 

J5 TTTTAGATAT CAACGTCATT TTAAAGCAAG CAAAGGTGAA AATGGCCAAA GTAGTAATAT 1680 

GCATGGTAAA AATGCGGAAG ATTTAGTATT AAAAGTTCCA CCTGGTACAA TTATTAAAAA 174 0 

TGTTGAAACA GACGAAgTGT TAGCAGATCT TGTTGAAGAT GGTCAAAGAG CTGTAGTAGC 1800 

20 GAAGGGCGGT CGAGGTGGCC GAGGTAATTC ACGTTTTGCA ACACCTAGAA ACCCTGCACC 1860 

TGACTTCAGT GAAAAAGGTG AACCAGGTGA GGAATTAGAT GTATCTTTAG AATTGAAATT 1920 

ATTAGCTGAT G T AGG ATT AG TAGGTTTCCC TAGTGTGGGT AAATCGACTT TATTATCTAT 19 80 

25 CGTTTCAAAA GCTAAGCCTA AAATTGGGGC ATATCATTTT ACAACGATTA AACCAAATCT 2 04 0 

AGGTGTTGTT TCAACGCCTG ATCAACGTAG TTTTGTTATG G C AG AT TT AG CAGGTTTAAT 2100 

TGAAGGTGCA TCTGATGGCG TTGGATTAGG ACATCAATTT TTAAGACATG TAGAGAGAAC 2160 

30 

AAAAGTTATT GTTCAGATGA TTGATATGAG CGGTTCTGAA GGTAGAGAAC CTATTGAAGA 2 22 0 

TTATAAAGTC ATTAATCAAG AATTAGCTGC GTACGAGCAA CGTTTAGAAG AT AG AC GTC A 22 80 

AATCGTAGTA GCTAACAAGA TGGATTTACC TGAATCACAA GATAATTTAA ACTTGTTTAA 2 34 0 

35 

AGAAGAAATT GGCGAAGATG TGCCAGTTAT TCCAGTTTCA ACAATAACGC GTGATAATAT 240 0 

TGATCAATTA TTATATGCAA TAGCAGATAA ATTAGAAGAA TATAAAGATG TTGACTTCAC 24 60 

AGTTGAAGAA GAGGAGTCAG TTGGCATTAA CCGAGTATTA TATAAACATA CACCGTCACA 2520 

40 

AGATAAATTT ACAATTTCAA GAGATGATGA TGGTGCTTAT GTGGTAAGTG GTAATGCTAT 2580 

TGAAAGAATG TTTAAAATGA CTGACTTTAA CAGTGATCCA GCAGTACGTC GATTTGCTCG 264 0 

4b TCAAATGCGT TCGATGGGTA TTGATGATGC GCTTAGAGAA CGTGGTTGTA AAAATGGTGA 2700 

TATCGTTAGA ATTCTTGGCG GAGAATTTGA ATTCGTTGAA TAGGAGCGAA ACATGATGGA 2760 

CAATAAAGAT TATAAAAAGT TTTATTTAAT TAGAGAAGAT GTCTTGCCTG AATCCGTGGT 2820 

50 TAAAACATTG AAGATTAAAG ATGCCTTAAA AAGTGATCCG ACATTGTCCA TTTATGATGC 288 0 

CGTTAAACAG TTTGATCTAT CTAGAAGTGC TTTTTATAAA TATAGAGAAA CGATATTTCC 2 94 0 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 30so 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

5 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 324 0 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 3 3 00 

10 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3 3 60 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

/5 AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGT CCG AAATCAGCTT 34 30 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 354 0 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 36 00 

20 ATTTAAAAGG T AAAG T G AAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 37 30 

25 CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 3 84 0 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3 900 

ACTGATTT CG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3 960 

30 

ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4 020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4 0 30 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 414 0 

35 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 42 60 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC T C CATTCA CT 4 32 0 

40 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 43 BO 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 444 0 

45 GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 45Q0 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4 620 

50 GATCAACACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 46 80 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 474 0 
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GGCAGAAAAG CAACACCATT AGCTTATGAA CATTTTGCAA AGTCGAATGA GGAGAGAGAA 486 0 

TAACGTGAAT ATTGAAGAAT TTGACTATGA CTTACCAGAA TCATTAATTG CTCAAACGCC 4 92 0 

5 

TTTAAAAGAT CGTGATCATA GTCGTTTATT AGTCATGGAT AGAGAAACTG GTGAAATGAA 4 980 

ACATTTACAT TTCAAAGATA TCATTGAGTA TTTTAGACCT GGTGATACAT TAGTGCTTAA 504 0 

CGATACGCGA GTAATGCCAG CTAGACTTTT TGGTTTAAAA GAAGAAACTG GTGCAAAAGT 5100 

10 

TGAAATGTTA ATGTtAACTC AAATTGAAGG TAATGATTGG GAAGTCTTAC TGAAACCAGC 5160 

TAAGCGTATT AAAGTTGGTA ATAAATTGAA TTTTGGTAAT GGCAAAATTA TAGCTGAATG 5220 

/5 CATAAAAGAA ATGGATCAAG GTGGACGCAT CATGCGTTTA CATTATGAAG GTATTTTACA 52 80 

AGAAAGATTA GATGAATTAG GGGAAATGCC ACTGCCACCA TACATCAAAG AACGTTTAGA 53 4 0 

TGATCCAGAT CGTTATCAAA CAGTTTACGC TAAAGAAAGT GGTTCAGCGG CAGCACCAAC 54 00 

20 AGCAGGATTA CATTTTACTG ATGAGTTATT AATTGAAATT AAAAATAAAG GTGTTAATAT 54 60 

CGCATTTGTT ACATTACATG TTGGGTTAGG TACGTTTAGA CCGGTGAGCG TAGACGATGT 5520 

GAATGACCAC GAAATGCATA GTGAATATTA TCAAATGACm CAAGAAACAG CTGATTTATT 55 80 

25 AAATGATACT AAGcCAAAGG ACATCGCATT ATATCAGTTG GTAC7\ACTTC AACACGTACA 564 0 

CTTGAAACAA TTCGACGCGA TCATGATAAA TTTGTTGAAA CGAGTGGCTG GACTAATATA 57 00 

TTTATTTATC CAGGATTTGA TTTTAAAGCA ATTGATGGCC AGATTACTAA TTTTCATTTA 5760 

30 

CCAAAATCAA CATTAGTTAT GCTAGTATCA GCGTTTAGTA GTCGTGAAAA TGTTCTGAAT 5820 

GCTTATAAAA CGGCAGTAAA TTTAGAATAT AGATTCTTTA GTTTTGGCGA TGCAATGTTA 58 80 

ATTATATAAA AAGAATGTGA GGATTTTGAA TATGCCTGCA GTAACATACG AACACATTAA 594 0 

35 

AACTTGTAAA CAATCAGGTG CGCGTTTAGG TATCGTGCAC ACACCACACG GTTCATTTGA 60 00 

AACXCCTATG TTTATGCCAG TTGGTACTAA AGCAACCGTT AAAACAATGA GTCCAGAAGA 6 060 

4Q GTTAAGACAA ATTGAAGCAA AAATCATTTT GGGCAACACA TATCATTTGT GGTTACAACC 6120 

CGGAAATGAT ATTATCAAAC ACGCTGGGGG ATTACATAAA TTCATGAATT GGGATGGTCC 6180 

GATTCTTACA GATTCAGGCG GTTTCCAAGT GTTTAGTTTA AGTAATTTAC GTAAAATTAC 624 0 

45 AGAAGAAGGC GTGGAATTTA GACATCATAC TAATGGGTCT AAATTATTTT TGAGTCCTGA 6 3 00 

GAAATCAATG CAAATTCAAA ATGATTTAGG ATCTGATATT ATGATGGCAT TTGATGAATG 6 360 

TCCACCGATG CCTGCTGAAT ATGATTATGT AAAAAAATCT ATTGAACGTA CAACACGTTG 6420 

50 GGCGAAAAGA TGTCTAGATG CACACCAAAG ACCTGAAGAT CAAGCATTGT TCGGCATTAT 64 80 

ACAAGGTGGC GAATATGAAG ATTTAAGAGA ACAAAGTGCA AAGGATTTAG TAGAATTAGA 654 0 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 66 6 0 

AGGATCTCCa GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 6 72 0 

5 CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 67 80 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG AC TGTTATAC 6 84 0 

ATGTCAAAAC T ATT CAAG AG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6 900 

w 

TATTCGTCTT ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6 960 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7020 

}5 ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 70 80 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGAC CACA 7140 

ACAAAAACTG TGCGAAACA 7159 

20 (2) INFORMATION FOR SEQ ID NO: 3 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS : double 

CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

30 

TCCATTTATA GAAATTTCTA AAGCAGAnAA TAAGATAGAA 

AATCATCAAA AGAACACAAG AC ATT ACT AG CAAACGATTA 
35 ATTTGATTTT GTAAAAGATA AAAAATATAA CAAAGATGCC 

CTTCATTAGC TCTAGAaCAA C AT ATT C AG A CTTAAAAAAA 

GATATGGCCA TTTCAATATA ATATCAGTTT GArAACGAAA 
40 TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC 

TAAAACAATT AGTTATAATC AAAAAAACTA TGTTACTGAA 

45 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA 
ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA 
50 CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAGTTT 
CATGGATGCT ACATATGCTT ACGTGACAAG ACATCGTTTA 



GAT AT C GG C C AAGGTGCAGA 60 

GCTATAACTC AAAACATTCA 120 

CTAGTTGTTA AGATGCAAGG 180 

TATCCATATA TTAAAAGAAT 240 

GACTCTAATG TTGATTTAAT 3 00 

AGTCAGAAAT TAGGCTATAA 3 60 

AGTGGCTCAT TCAACTACTC 420 

GTAGAAAGTC AGAACTCTAA 4 80 

CCGAATGGTC AAGTATCTGC 54 0 

GCAGCACGAG ACTATTTCGT 600 

AATCCATCAT TTATTACAAC 66 0 

GAAATcACTT ACGGCAGAAA 72 0 

GCCGTTGATA GAAAACATGA 78 0 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 9 00 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 9 60 

TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 10 20 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 10 80 

TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAAT CGT AAA AAAGCACAAT 114 0 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTGCAAT CGGTTACCTT 12 00 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 12 6 0 

GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 13 20 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 13 80 

AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 144 0 

TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAATCC 1500 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG GAAGCGATAT 1560 

AGAAATTATC AAAAGGACAG AAGATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 162 0 

TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT G CTTTG AT AT TAAAGATGCA 16 80 

AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 174 0 

TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT 1800 

AATTAATTAT TTAC CTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 1860 

CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 1920 

TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 1980 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 204 0 

AG C CTTTG AT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 2100 

40 TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CG ATACAAGC GAATTTGAAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA ACAGTTATTT 2280 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 2340 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 2400 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 2460 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCGA TGACAAAGTT ACTTTATACA 2520 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 
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is 



ACTCAGGCTT TGTGnAACCT AATCCTAATG ACTATGACTT TTCAAAATTA TATTGGGGAG 2 7 00 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 276 0 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTGGTG 2 820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 2 8B0 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2 94 0 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 3 0 00 

GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3 060 

GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AGCGTACTAT CACACAGAC 3159 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAACAAAATT GATATGATGA 60 

ATCAAATTAA AGTTGATACG ATGTTGCATG GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

ATTCTCGTCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA AGTTCAACAT 24 0 

ATTGACTTAT CCGGCATTTC AGATTAAAAT ATTTTTATTC CGATTAGAAT AATAAGAATA 3 00 

AGGAGATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT TTCATATTGG 3 60 

CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC AAGACAATTA 4 20 

ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT GGTAAATTAA 4 80 

CTGGTTACGA TATTGATGTT ATTAAAGCAG TGGCTAAAGA AGAAGGTTTA AAACTTAAAT 54 0 

TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAGA CGCAGGGCGT TTTGATGTAA 600 

TCGCGAACCA AGTAGGTATT AATC CTGATA GAGAAAAGAA ATATAAATTT TCTAAGCCTT 66 0 

ACACATTCTC AAGTGCTGTT TTAGTTATTC GTGAAAATGA AAAAGATATT AAAGATTTTG 720 

ATGATGTTAA AGGTAAGAAG TTAGCACAAA CATTCACATC TAATTATGGT AAATT AG CT A 78 0 

AGGATAAAGG TGCTGATATT ACAAAAGTTG ATGGCTTTAA CCAATCAATG GATTTATTAT 84 0 
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AAAAACCTAA TGcTAAGATC AAAGCAATCA AAGGTAATGC TGAACAAAGT AGATCTGCAT 96 0 

TTGCATTTTC TAAAAAAGCA GATGATGAAA CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 102 0 

5 AAATCGAGGA AAACGGTGAA TTAGCTAAAA TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 10 8 0 

CTAAATCTAA ATAGCGAACA ACAACATGCG CTAGATGCTG CAAAACAAGC TTTCGGACCT 114 0 

ATGCTAGAAG GTTTGGTCAA ATATTCAATT CCTATTACAT TAGTTACATT TGTTTTAGGA 12 00 

10 

TTGATTATTG CATTATTTAC AGCATTAATG CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

ATTTCACGTG TCTATGTATC TATTATTCGA GGAACACCAA TGATAGTACA ACTATTTATC 13 20 

ATATTTTATG GTATTCCAGA ATTAGGTAGA TTATTAACAA ATGACGCTGA CAACCAATGG 13 80 

ACATTGGCAC CTGTAGTGGC TGCTATTATT GG TTTATCAT TAAATGTAGG TGCGTATGCT 144 0 

TCGGAAATTA TTCGTGGCGG TATTATTTCT ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

20 TcCaTCGGTA TGACGTATGG TCAAACGATA CAACGTATCA TTTTACCGCA GGCAATTCGA 156 0 

GTGTCGATTC CTGCACTAGG TAATACATTT TTAAGTTTAA TC AAA GAT AC ATCATTATTA 1620 

GGATTTATTT TAGTGGCTGA AATGTTTAGA AAAGCTCAAG AAGTTGCGTC TACAACATAT 16 8 0 

25 

GAATATTTAA CAATTTATGT GTTAGTTGCG CTAATGTACT GGGTGGTATG CTTTATTATT 174 0 

TCAATTATCC AAGGTATCTA TGAATCTTAT ATTGAAAGAG GGTATCGCTC ATGATTCAAT 1800 

TGAACAATAT CCATAAATCA TTTAATGATG TTGAAGTCAT CAAAGGTATT GATTTATCTG 1860 

30 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG GTCGATCTGG TTCAGGTAAA ACAACATTGT 192 0 

TACGTATGAT TAATGCATTA GAAATTCCAA CTGAAGGTAC AGTTTATGTT AACGGCAAAA 1930 

35 CATATACATC TAAAGATAAA AAATCACAAA TAGAAGTTCG TAAACAGTCT GGTATGGTAT 2 04 0 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

TCACAGTTAA AAAGTTGAAA AAGGATGAGG CACGTGGGAA ATCACTTGAG TTACTTGAGA 2160 

40 AAGTTGGTTT AACACATGTC AAAGATCAAC GTCCACATGC ATTATCAGGT GGTCAACAAC 2 220 

AACGTGTTGC TATTGTCAAG AGCACTAGCA ATGAACCCTA AAGTGATGTT GTTTGATGAA 2 2 80 

CCAACATCTG CACTTGATCC TGAACTTGTG AATGATGTTT TAAAGGTTAT TAAAGATTTG 2340 

45 

GCTAATGAAG GCATGACAAT GGTCATTGTG ACACATGAAA TGCGTTTTGC TAAAGAAGTA 24 00 

TCTAATAACA TTGTATTTAT TCmTGAAGGC ATGATCGGAG AACAAGGGGC TCCAGAAGAG 24 60 

5Q ATGTTCAATC GTCCGAAAAC AGAAGAATTA AGACGTTTCT TAAATGTTAT AAATGAAGAA 2 520 

TAATCAAATA GAACCACGTA TCATGTTTTA GTATGGCGAT GAAGCCATAT ACATGATGCG 2580 

TGGTTCTTTG TTATGTTGTC ATAATCTTGG AGCGATATTT TAACGACGTT TATGATTTAA 264 0 
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TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG CCATAGTCTT CTGATGACTC 276 0 

TGAAAAGTCA CCGGTAATAC CATAAAAATT AT AG CG ATCA ATATGATGCG CTTTAGCAAA 2820 

5 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 2880 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2 94 0 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3 000 

w 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3 06 0 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 312 0 

is CGTTTTTAAA TACTCGTTTA AATCAATATA CGCCAACTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTT T GC ATTTCTTCAA AGTATGGTAA CTCACGGAAT TTGAAACCGT GCTTTTCCTC 3 24 0 

AGCCATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

20 GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 33 6 0 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 342 0 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 34 8 0 

25 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 3 54 0 

TAAATATGGA TCTACAAGGA CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3 600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 3 660 

30 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAG CATC CTGCAATCAC 3 720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3 78 0 

35 ATTG TAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3 821 

(2) INFORMATION FOR SEQ ID NO: 3 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 
5 GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 
CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 24 0 

55 
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TATTGTAATT GTGTAGCTGC ATAGTCAGCA TAGTTATTCA TTGAACGTGC AAATGAAGCA 3 60 

TAGTCGTGTG TGTGTACGAT TTCAACATTG ATGAATCTAG GGTTACCGAC TG CACCG ACA 420 

5 CCC CAAGATA AGTAATCCGT TGGTGCTGTT TCGATTATAC GATCCCCATC AACAAATGCA 4 30 

TGTACGAATG CGTTTTGATA GTTATTTTTC ATATAACTAA TTTCACCATT TATCGTCGAA 54 0 

CGATCATTAG CTGTATCATG AACTACGATA CCTTCAGGAC GACCTACGCC GTTACGGTAT 6 00 

10 

GCGTATTTAG GGAAGTAAGA TGTATAATCT TCTTCAATTT TAGGTGCTTT TAAGTTATTT 66 0 

TTACAAATGT AATCGTTAAT TGAAGAGTTT ACTTGTGGTT TATATTTTGG CAAACTCGTT 72 0 

TTTGGTGTTG CAGCAACTGA TCTTGGTTGT GCTGAAGCGC TAAAAGTAGT TACTTTAGGT 730 

is 

GTCGCTTCAG TTTTAGCTTT AGGTGCTGAT GTAGTTGCAG CTTTAGGTGC TGCGGTTTTA 84 0 

TATTGCGTyT CAAGAGCTGC AGGTTTAGCA GCTGATTTAA TTAATTCTGG ATTAATTTGA 900 

20 TCTTCTGAAT TATCATCTTC ATCATCAACT AAACTATAAC CAGCATTTGT AACATTAGTG 960 

TTAGTTTTAG GTGCTGTAGT GCTTGTTGAC TTTGCAACAG GCTGCGTATT ATTTGTAGTC 102 0 

GCTGATTGAT TAGCACGAGT GTCACCATTT ACTTGTGCAG TATCAACTTT TTGACTTACT 108 0 

25 TGAGCATTGC CTGTTTTGTT ATTTGCTGTT TTTGGTTGGA CAATAGCAGG GTCTTGATAT 114 0 

ACTTGAGTGC CAGAAATGTT TTGCGTTGGA TTTTTTACCT CAGCTTTTGC TTGTTCAGTA 120 0 

GTTGCTTTAA CTTTATTACT ATCTAAAACG TTTTTATTAG TAGTTTGATC TTGTGTCGTC 1260 

30 

TCAGCTGCTT GAACTTGATG TGCAGTGACT GCTGAACCTA CAAGCGTTAA TGCAACCATT 132 0 

GAgGTAGTTT GTAATTGAAT TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 13 80 

TATtACTAAC ATTATAGTAC CTGTnTTATA TACCTGTGCG TA 14 22 

35 

(2) INFORMATION FOR SEQ ID NO: 3 06: 

- ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6076 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 6 0 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 130 

GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 24 0 
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GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 3 60 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 420 

5 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAATCAT 4 80 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 54 0 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 600 

w 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCCTT ATTGTAATTT TCCAACTACT 72 0 

/5 CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 84 0 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 90 0 

20 T CAT ACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 96 0 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 1030 

25 CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAAT AC C AAGACCAGGT CCTGGGAATG 114 0 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 12 0 0 

CTTTAAACAA TGTATTGATT GGTT CGATT A ATTCGAATTC CATGTCTTCT GGTAATCCAC 12 60 

30 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 13 20 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 1380 

35 ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 144 0 

TTAATTTATT CATAAAG CG A TCTTTCGCAT TAA CACGAAT AATATTCATG TTGAAACCTT 1500 

CACCGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 1740 

45 GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTC ATC C AAA 1920 

50 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC ACGTTCATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 204 0 

55 
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TAAATGAACC TTCTTCATAA ACTGAATTTG GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

TCATTTTCTT AATTTCTTCA ATTGAAATTT CATGATCGTG TAATTCACTA TAAACGCCCA 2220 

5 

TTTCACGAAT TCGGCGTGTA ATTAATTGGT TGTATTGGCT ACCAAAGTCT AAGACAAGGA 2280 

TTAACTCTTG TTCTTTTGCC ATTTCCATAT TTGTCGTTCT CCTTTATCTT AATTAGAATG 2 34 0 

AGTAGTTCGG TGATTCTTTC GTAATTTGAA TATTATGTGG ATGGCTTTCT GCTAAACCAG 24 00 

10 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT CGCGTAATTC TCTTAAATCG TGTGAACCAG 24 60 

TATAACCCAT ACCAGCACGC ACACCGCCCA TTAATTGGTA AATTGTATCT TGTAACGCAC 2 520 

/5 CTTTATAAGC CGTACGTCCT TCGATACCTT CAGGAACAAA TTTCTTAGGC GCTTTGTCCT 2 530 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT CCATCGCACC TAAAGAGCCC ATACCACGGT 2 64 0 

ATACTTTATA TTGTCTACCT TGGAAAATTT CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 2700 

20 CTAATAAGCT AC CTAACAT A ACCGCATGTC CACCAGCAGC TAATGCTTTA ATGATATCTC 276 0 

CTGAGAATTT AATACCACCA TCAGCAATGA TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 2820 

CACAATCATA AATTGCTGTA ATTTGTGGTA CACCAACACC TGCTACAACA CGCGTCGTAC 288 0 

25 AAATTGAACC TGGGCCAATA CCAACTTTAA CAATATCTGC ACCCGCTTCA AATAAATCTT 2 94 0 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG CTACTAATGT GATTTCTGGG TAAGTCTTCT 3 0 00 

XAATATGTTT CACTTGATCG ATAACACCTT TAGAGTGACC ATGTGCTGTA TCGATAACTA 3060 

30 

AGACATCCAC ACCTGCTTCG ACTAATTTTT GAGCACGAAT ATCAGTATCT TTTGAAATAC 3120 

CAATTGCTGC GGCTACAAGT AGACGACCAT GTTCATCTTT TG CTGCATT A GGGAATTCGA 318 0 

TAACTTTTTC AATATCTTTA ATAGTAATAA GACCTTCTAG ACGTCCGTCT TTAACTAATG 3 24 0 

35 

GTAACTTTTC AATCTTATGT TTTTGGAGAA TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 330 0 

CTGGAGCTGT AATTAAATTt TCTTGCGTCA TTACATCTAC AATTTTAATC GAGAAGTCTT 336 0 

40 CAATAAAACG TAAGTCACGG TTTGTTAAAA TACCTACTAA GTTGCGATCT TCTTTATTAT 342 0 

CAACAATTGG TACACCTGAA ATACGGTATT TACCCATTAA TGCTTCTGCT TCATAAACGC 34 8 0 

TTTCTTCTGG CGTTAAGAAA AATGGGTTTG AAATGACACC ATTTTCTGAG CGTTTTACTT 3540 

45 TTTGAACTTC GTCCGCTTGT TCTTCAACGC CCATATTTTT ATGAATAACA CCTAAACCAC 3600 

CTTGACGAGC CATAGCAATC GCCATTTTAG ATTCAGTTAC AGTATCCATA CCAGCAGAAA 366 0 

TAACTGGAAT ATTTAATTTA ACTTTGTCTG ATAATTGTAC GCTTAAATCA ACGTCTTTCG 3720 

50 

GTAAAATATC AGATTGTGCT GG AATTAATA ACA CAT CATC AAACGTTAAT GATTCTTTTG 3780 

CAAATTTACT TTCCCACATT AAAAACAGCC TCCATTTTTC AAATTAATTA GTTATATTAT 3 840 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3 96 0 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA At CCTACTGA TACAGCGATA 4 02 0 

5 

ATTAATAAAT TGTTTTGATT TTTAAAATCG AT ATGTC C T A ATATACTAAC AC CAT ATGCC 4080 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 414 0 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4 200 

10 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4 260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4 3 20 

l5 CCTTTACGAA AATCT1TTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4 3 80 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 4 44 0 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GG CACTGGG A TGCCTAACCA ACCGGCATGA 4 5 00 

20 TTGACTTGAT TAATATCGAC CAT CCC AAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4 56 0 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

25 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 4 74 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4 86 0 

30 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4 92 0 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4 980 

3S ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 5 04 0 

GG AACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 510 0 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 516 0 

40 GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 5340 

45 ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 54 00 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 54 60 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGG AAGCT TCAATGGTTA 5 520 

50 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 558 0 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 564 0 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 576 0 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 5 820 

5 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5 8 80 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5 94 0 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 60 00 

w 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6 060 

ATCTTTTAAT AGTTTT 6076 
, 5 (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: < 

25 

ACCGCCGATT GATAGCTTTA CTGCTGCGAT 
TTTTAACACA CGATGATTAC CAT CAT A CAT 
AGATCCAATA TCGTAAAAAC ACAATGCATC 

30 

GATATCAAAT GAAGTTCCAA TTGAACCATC 
GTCACCTGCC ATTTGCTTTA ACAAAGATTT 

3S AACAAGTATA ATTTTAGGCA TTATCTTCAT 

TACTTTGAGC ACCTGGATCA ATATAACCTT 
CTTTCGTTGC TACCATATCT TTTGTATTAT 

40 AAGTTTCACC ATTTTTAAGC TTCTCTGCTG 
CATTTAAAGT AACTTTACCA CGTGATTCAA 
TGAAATCTTG ATTATCCATA TCATCTTTGG 

45 CATACAGTGG TCCTGATGCA CCCCCAACAT 

ACAATGATTG CATTGAGCTA TCATCAAGTT 
TGTTAACCCC GTGGTCACCA TCACCAATTG 

50 

GTTTTTTAAA CGTTTCTTCT AAATTTAATA 
GTGCCACCTC ATAAATGTAT ATTTATTCAT 

55 



IEQ ID NO: 307: 

AAAACTGCCT TCAACAATTG GTGCATCAAC 60 

TTCAATTGCC ATATCTACAT TCATTTCTGA 120 

ATCCTCTAAT TTAGTCAAAA CTT CTTGGAT 18 0 

TGGTAATCCC CCGATTGGTA TAATAT CAAC 24 0 

TGTACCACTT GcAATTTCTT TACTGTGGCT 3 00 

CTCCAATCAA AG CGTTT AAA ATATAAACCA 3 60 

TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTGCTAATTG CTGTAAATCA TTGAATGTTA 4 80 

CTCGCGCTAC TACATCATAC ATTGTCTTTT 540 

CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 6 00 

TGACTGCAGA CATTTTAACA AAGCTAAAGC 660 

TTGACATCAA TGC CAT A CCA GTTGATTTGA 72 0 

TGTCTTTAAG ACTACTAAAC CCACGAACCA 78 0 

CTCGATCTAA TTCAGTTAAT TC AG ATT CAT 840 

AACGTGCTTT CATATCATTC ACTTTCATTT 900 

ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 
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ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 10 8 0 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 114 0 

5 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 12 0 0 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 126 0 

TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 132 0 

10 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 13 8 0 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

/5 GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 16 80 

TACTTCGCCA CAAACCGCTG CAT CT AG CAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

25 TACAATAACT GT ATT AG CAA TCAGATCTAA CTCTGGGTGC GCAATTAACA ATCCTTCAAG 1860 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

30 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 204 0 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

35 AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

CCACCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 22 80 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 24 00 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 24 60 

45 CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 2 580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 2 64 0 

SO 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 2700 

GCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 

55 
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CCAATTGCAA TAAATAATGC GATTGCAAAT CCGCCACtGC GTTAATACCT AATATACCTG 2 3 30 

GTTCAGCTAT TGGATTTTTT GTGACACTTT GCACTATTGC ACCACTAATA CTAAgCtGCG 2 94 0 

5 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT AACCTGAAAT CTAATAAGAT TAACTCATCA 3 0 00 

ATGGCATCAC CTTGTCCAAT TAAAGTTTTG AAAAATCTTT CAACAGGTAT GTTGTATTCA 3 060 

CCTGAGGTAA TACTCCAAGT ACAACCTAGA AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

10 

ATCAATTGTC TGCGTTTATT ATTTGAACTA AT CAT ATTG A GCGTCCTCCT TTTTTAACTA 3130 

AATATAAAAA GTAAGGAACA CCGATAAATG AAA TG ATTG C ACCAACAGGC GCTTCTCCTA 3 24 0 

r5 AATATCGTGC TATCACATCG GCAACAAGCA CGAGTATCCC ACCTAACAAG GCTGTTAATG 3 3 00 

GTAGAATTTT AGCATAATCA GTTCCAATTA AAAATCTTGC TATATGAGGT ACCATCAAAC 3 3 60 

CTACAAATGC AACTTGTCCA GCGATAGCAA CTGCAATACC TGCTAGAATC ATAGCAATAA 34 20 

20 TTAAACATAT GCCTCTGATC ATTGTTACAT TTTGACCTAA ACCTTTAGCT AATGATTCAC 34 30 

CAAGATTTAA AATGGTAAGT TGTTTACTAA TTGTTAATAT AATGAATAAC GCAATACCAA 3 54 0 

TTAATGGAAT TGCCCACTTA AGGTGTGACC ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3 6 00 

25 AAAATGTTAC TGTTTGATTT AGTCTAAAAG CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3 6 60 

ACATAGCACT TACTGCTGCA CCCGCTAAAA TAATACGCAT CGGATTAAAT CCATCACGTC 3 72 0 

TAGATCGGCC TATCATTAAT ACAATAGCAC CTCCTAGAAT AGCACCTAAA AATC CAGCAA 3 7 80 

30 

ACATCAATAT TAAAAATGAA GTGTTTGGTA AAACTGCATA TGTTAATGCT AAAGCAAATG 3 84 0 

AAGCACCTGA ATTTAAACCT ATGAGCGCCG GATCAGCAAG ACCATTACGA GTAACACCTT 3 90 0 

GTATAATCGC ACCAGAAACT GCAAGCGCCA TACCTACAAT TACTGCTGCT ATATTTCTGG 3 96 0 

35 

GAATCCTAAT CTCATTGATG ATGTTTTGCT GTTGATTGCT AGGATTATAA TTAAAAATAG 4 02 0 

CCTCTATAAT TGTAGAGGCT TGAATTTTGG CGTCACCTAT TAATGTAGAA ATAAATAGTG 4 0 30 

40 TGATTAGTAG TATCATACTT AAACCTATAA TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT TTATCATATT AACTTACTTA ATTAAGAATA 42 00 

AGCTCTGCGA CATAAGTCAT AAGTTACCAG TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4260 

45 TAAAACAACA TCAATATTAA AAACTTTTTC TAATATTTCC TGTGTTAATA CGTCTTCTGT 4320 

TGAACCTGTA GCGATGATAT CCCCTTCTTT CATCGCAATA AGATGATCTG AGAAACGAAT 4 3 SO 

CGCTTGGTTG ATATCATGAA GAACCATGAC AATTGTACAA CCTTGTTCCT GATTTAGCTT 444 0 

50 

CTGAACTAAT TCTAGTATTT CTAATTGATG ACAGATATCT AAATATGTTG TTGGTTCGTC 4 50 0 

TAAAAAGATA ATATCAGTTC TTTGTGCTAA TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4 560 

55 



1148 



EP0 786 519 A2 



TGCCCAATCA ATtTCTTTCT TAt CCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGG 4 6 80 

AAAACGACCA TATGAAACTA ATTCCCCAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4 74 0 

5 

aGGTAAAaTG GCtATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACcA 4 800 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAG CAAAGTA 4 860 

GATTTCCCGC AGCCGTTAGG ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4 920 

10 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 98 0 

CGATTCATAA TTTCCCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 504 0 

}5 TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT GATTATCATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 516 0 

TGTCTTGTAT ATTACTTTTA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 52 2 0 

20 AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTGTGCTCAT ACCATTTATT TATTCATCTG 52 80 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT G CAT CTTTAT 534 0 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 54 00 

25 AGACATCTTT TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 54 60 

CAATTTGCTC GTCTATTGCT TTGGCAAGTT TGTCGGGATT AGTTGAAATA ATCGTCATAG 552 0 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

30 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT GCAACAACGA 564 0 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC GTTGCAATTC 5700 

TCGCCAAAAT TGTTGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 5760 

35 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTC CAGTT TCTACTTGCC 5 820 

ATGATTCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA 5 880 

40 TACTTCTCTT ACTCAAAAAt TTATAAcCTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 594 0 

TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA CCTGTAACCC 6 000 

CGCCTTCACC TAAGTTACCA GAAAT^ATAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 6060 

45 AACCTAAGAC AACTAGTATT AAATCTTTAA CCGTTTTATT CACGAAACCA TCCCCTTTAT 612 0 

ATATTTATTA GACTAT 613 6 
(2) INFORMATION FOR SEQ ID NO: 308: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

GATATCGTAS CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 

TTCTATG CAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 120 

TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 

TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 24 0 

GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 3 00 

ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGTAAGTATT 3 60 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 420 

GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 4 80 

AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 54 0 

ACAATGACAT TCCAATTGTT AATCGTTCCA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 6 00 

GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 66 0 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 72 0 

TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAATGATAT ATGTAAAGAA 780 

ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 84 0 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 96 0 

GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 102 0 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 10 80 

TCAAC CGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGG A CTGTT A ATGATTATTC 114 0 

40 CTG CAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATT ATT AG TT GCAACCGGAA 120 0 

CACTAGCCCA TATTTATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA 1260 

TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 

TCGTTTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 1380 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 144 0 

TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 

CAATTCAAAA TGATTTTAAA C T CAT AT CT A AGTTGTTACA CACTTATAAT GATAAATCTG 1620 

55 
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TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATTA TTATTATTAC 174 0 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACGCTAAA CAAATTCGAC 1800 

CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG 1860 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 192 0 

TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGCAAA TGAAGCGCGT ATTGCGTATG 1980 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA A C a TTAG A CT 2 04 0 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT ATCTGCACAT ACAATATTAC 2160 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 2280 

TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATTG 2 3 40 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATT AG CATC A CAGAATTTAA 24 00 

CAATTAACAT 7GCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 24 60 

TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC ATTATTTATT TTCTGAAAAT CAATAT 2 57 6 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 

TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAA T CAT ATT GAATTTCAAG 180 

TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 24 0 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 360 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG ACCATCATGA ATAAACCAAT ATGACTGTGC 4 80 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 6 00 

CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 

ACCAGATA 66 8 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SSQ ID NO: 310: 

CAT CAGTTT A TTTTGAAAGG CAATGCGATC ATTTCATGTA TTTATGTTGT TTGAAACATC 60 

GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 120 

CAAATAAACA ATGAATTTTA GAAGGTACAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 180 

GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 240 

TTTTTATTAA GCGTTAGGTT GAATG TATG A GAATTTTAGA TTTATAATAG AAGATAGAAA 3 00 

CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACT CAAAAT A AGCTATAATA ATGACACTTA 360 

TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 420 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 4 80 

ATTTAGGGTT GAACTATACA TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 540 

GCTTTAAAGT GGAATATACT TGAAATAATT AGTAGAGGTG AGTAAGGATG AGTAATAAAT 600 

TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 660 

AACqTGGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 720 

ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 780 

TCAACGATAA TACTATTAAG CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 84 0 

AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 

CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 1020 

GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AACAGATTAA AGATAAATAT GATTTAAATG 114 0 

TTGTCAGCGA AATCGTAAAT CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAGACGTAT 1200 
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AAAAGCCTAT TCTATTAAAA CGTGGTTTAT 
CTGAATACAT TGCTTCACAA GGTAATCAAA 
* CTTATGAAAA GGCGACACGT AACACTTTAG 

GTACACACTT ACCAGTCATG GTAGATGTTA 
TACCAACTGC GAArsAgCAT TAGCAGTTGG 

w 

AGATCCATCT GTTGCACTTA GTGATGCGGG 
ATTTTATGAT GAATTAAAGC CTTTAGCTGA 
TTCCAAGGAA ACTATAGACT ACTTAACTAA 

1S 

GAATCGACAA TTTTAAACTT ACAGCCATTC 
ATTTTTTGTA TAGAAATAAA GGTATACTGC 

20 TGTTCAATTT CACAATTATT AAAGATTATG 

GAATTATCTG TAAAATATAA TAGATTCACA 
ACAAATGCAA TTGATATTTG AAAAGGCTTT 

25 GTTTATATAA AATTAAAATA TATCGTTCGT 

TTTTCGCTAA ATTTTTCCAA ACAAAAATAT 
GAAAATGAAA TGAAAACATG TTATTATAAT 

30 

ACAGTTACTA TATATGATGT AGCAAGAGAA 
GTTGTTAATG GGAACCAAAA TGTTAAAGCA 
AAGCGTTTGA ATTATCGTCC AAATGCTGTT 

35 

ACAGTAGGTG TGATCATTCC AGATATATCT 
CTTGAAGATA TTGCAACAAT GTATAAATAT 

40 CCTGAAAAGG AAAAAGAAAT TTTTAATAAC 

TTCCTTGGTG GTACAATTAC TGAAGAAATG 
GTAGTAGTAT CAGGAACAAA TGGTAAGGAT 

45 ACTGAAGCTG CGAAAGAAAT TACGGGAGAA 
TTAGTAGGTG GAGAACATTC TAAAAAAGCT 
GTGTTAAATA AAAATGGCCT TCAATTAGGT 

50 

TATAAAGAAG GCGTAAAAGC TTTTGCCAAA 
TGTATCAGCG ACGAAGAAGC AATTGGTATT 

55 



CTGCTACAAT CGAAGAGTTT GTTTATGCAG 13 2 0 

ACATTATTTT ATGTGAACGT GGAATCCGAA 138 0 

ATATTTCAGC AGTACCAATT TTAAAACAAG 144 0 

CGCATAGTAC AGGTCGTAAA GATATCATGT 1500 

TGCTGATGGA GTTATGGCTG AGGTGCATCC 156 0 

TCAACAAATG GATTTAGATG AATTCCAAGC 162 0 

TTTATATAAC GCTAAAAAGT TAAAATAATA 16 80 

TATGTCATGT TGAAGTAGAA TATTATCTTT 174 0 

TAAGAGTATA TTACTTTTAG AGTGGCTATT 18 00 

ACTTAACGAT TGTTATAATA CTTCGACACT 1860 

ACTGATAGCA GTAATTAAAA TTATAACTAT 1920 

CATTTGTTGC TGAAATGTGA ACATTTTTCA 19 80 

CTCAAAACAT TACAATTAAA AATGAAAAAA 204 0 

TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

GTTACTGTAA ATTAAAATAT GGTAAACTAT 216 0 

GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

GCGCGTGTCT CTATGGCCAC AGTGTCGCGT 22 80 

GAAACTAAAA ATAAAGTTAA CGAAGTCATT 2 34 0 

GCTAGAGGTT TAGCTAGTAA AAAGACAACA 24 00 

AATATCTATT ATTCACAACT TGCTCGTGGA 24 6 0 

CACTCAATTA TTTCAAATTC AGATAACGAT 2 520 

TTATTAAGTA AACAGGTTGA TGGTATTATT 2 58 0 

AAAGAATTGA TAAATCAATC ATCTGTACCT 264 0 

GCACATATAG CATCAGTTAA TATTGATTTT 2700 

TTAATTGAAA AAGGCGCTAA ATCATTTGCT 276 0 

CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2 82 0 

GATACATTGA ATTGTTCTGG TGCTGAAAGT 2880 

ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 294 0 

ATGCATAGTG CAATGGATGC TGGTATTAAA 300 0 
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CCACAACTTT CTAGTGTTAT TCAACCATTA 
TTAACAAAAT ATATGAACGA TGAAAAGATA 

5 

ATTGAATACC GAGGAACTAC AAAATAAATT 
AATTATATGG GTGTTGGAAG AATGCCTTTT 
TTACTTATTG ACGAGTAGAT TCGTACCAAC 

10 

TTTCACTAAT TTCTTTTGTG CGCGGGACAT 
TTTTAGGTAA TAATTCAGAT GAATAATGTT 

15 AACCACTTTG AATTGGTTGA TCAATTAAAC 

CGACTCTCCA AATATTGTAG CCTCCGCCAC 
AATCAGCTAA ATATTTTACA AAATATGGAA 

20 GATGAGTTAG TGGATCACGA TAATGTATAT 

GTTTAAAACT CTTTACGACA GGCTCAACTG 
CTTCTGTATA CGGTTCAAGT GGGACATTTA 
GCTCAGTATA GTGACCAGAG C CTGGG AAAA 
TAGTAACATG GTTATCGGCA TAGAAACTCC 
TATCTATGAT TAAAACGCGT TGATTGTATT 

30 

TATCATTGTA TATACAAAAA CCACTTGCTC 
CTAAGTGACA ACCATTTAAT ACTTTGCCTG 
CTCCAACAAT TGTGGCACTA TGGCGGTGCA 

35 

TTAATCCATA TTTCTTAGCC TCATCTTCAC 
CTTCGACGTA ATCATATTTA TGAATTAACA 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT 
AAACATATGC AGTTTTTGAT GAATGTTGTT 

45 AAAAGAAAAA CCGATTCATA TAACGAATAT 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT 
CTGTAGCGAA TATTTCAAGT CCACCAGTTG 

50 

CAAATACATC TAACTTTGAA TTTTTTAAAT 
CGATATAATT CTCGAATTCT GGTGTAGAAA 

55 



TATGATATCG GTGCAGTAGG GATGCGCTTA 3120 

GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3130 

CACAAAATTA GGC ATT CATC TAACGACCCA 324 0 

ATTTATCTTT TAAAATCGTT GCAGATTAGG 3 3 00 

TCGCTATATG TAAAGCTAAT TTTTTATTTT 33 60 

AGGTATAATC ATTTAAACGA TCTTCCCATC 34 2 0 

TCCATTTATT AATCCATTCT AACGGTAAAT 34 80 

TTAAGAATAC ATGACTCCAT GCACGTGGTA 354 0 

CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

TTTCATATAA TGAATGTAAC GTACAATTTA 36 60 

CGACACCATT rACGCTTAGA ATAATATCAG 3 720 

TTAATTTAAA ACACTCCAAA AATG ATG CAT 3 7 30 

CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 84 0 

GAAATTTTCC GGTTTCATGG ATAGAATAAG 3 900 

ATTGTGT AC C ATCTCCATGA TGTGCATCGG 3 96 0 

CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 02 0 

GACCAGGTTG AGCGTGATGC AAACCACCAC 4 0 80 

ACATAATAAG ATCTGCTAAA GTTAAAGCGC 414 0 

TATGCTTAAA TTGACCATTC TCTTCATCAT 42 00 

TGATAATGCC ATGTGAAGCA TGCTTAATAG 4260 

TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 43 20 

CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4 380 

GAAATCGATA TTGTAATAAC TTATCTGAAT 444 0 

GCATATAATC CCTCCGATAT TCCAAAAATT 4500 

CATCAAACGC TTGTTGCTGT TCTAATGTAA 4 560 

TAGCTGGATG ACTTGTTATT TCTGGATCAT 4620 

CCATTAACCG CTGCATTAAT TTTTTATAGT 4680 

CCCAATGCCA GTAATATTCT GTAGTTATAA 4 740 

GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 800 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4 92 0 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4 980 

5 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 504 0 

TCAAATCTtC GTGTGTTAAA GG AC CTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 5100 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

10 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 52 80 

15 AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

CCTTAAAGAT TATGAAGAAA CATATAATAC TTTTGATTGG AAAGACGTAG AACAAGCATT 54 00 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC GCCATGTAGA 54 6 0 

20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 55 BO 

AGAAGTTGAC AAAGGTGACA GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 564 0 

25 TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATTTAT 5 70 0 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 576 0 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

30 

AGATGAGGAT GTAGAAGACA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 58 8 0 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 594 0 

TTCTACTGGG CAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

35 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGTTGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA TGGTTAAATG GCGCTACAAA 612 0 

40 TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 6240 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 63 00 

45 AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAGATACTTG 6360 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 64 20 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 64 80 

50 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 654 0 

GTATCGTATC TGGAAGAATC CAGAAAAATA T AAAT CAT AT TTTATTGGAG ACTGGTATGT 6 600 

55 
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TGATGTAATT ATGACAGCTG GTGAACGAGT 
TGAACACGAA GCAGTTGCCG AAGCAGGAAT 

5 

AATAATTAAG GCGTTTGTTG CACTGAGAAA 
AGAAATTCGT ATATTTGTTA AAGAAGGTTT 
ATTTAAAGAT AAATTACCTA AAACACGGTC 

10 

TTGGGAATTA AATTTAGATG CTGGGGATTT 
TTGAAGATTT TTTTCGAAGA ATAAAGGGTG 

/5 TCGTTTACTT TACGATAAGC AATATAAAGA 
AAAGCATTTG AAAATGATGA CATAACAATA 
TAATATGATC TTTGGAAGAT GATTATTTTA 

20 TCAAGCGCTA AAAGTTGTAT AATACAAAAC 
AGCGCTAACC CGATTTAGTC GACAAGTTTT 
AAAATTTCTT AGCTACAACT TACATATTAT 
AAAAGTTGAC TCATTTATCA GATTTAGATA 
AGGATATTGC TGCATCAGTA GGTATTTCAG 
AAGCTAAAAT CGACATTAAT AAAATTACGC 

30 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG 
CTGATGCATT CCATGAGTTA AATAAAAACG 

J5 GACCAACATT TGGTATCAAA GGTGGTGCGA 
TGGAAGATAT CAACTTACAT TTCAACGGAG 
CATX^TCTGC GTTTATCGAT AATCATATTC 

40 GACGTATTGA GTGGAAACGT GTATTAGATA 
TTGGGTTAGG TGGACCTACA AATGGTGTAC 
CGTCTGAAAT TATGGCGATT TTATGTTTAA 

45 

TTAGTCGTAT TACTATTGGT TACACTAGAG 
AAGTGGAAGG TGCACTTGCA ATGATTTTAA 
CAATTGAAGG GACACCTGCA TTAGTTCATG 

50 

GTAACTCAAT TTTAGCAACT GAAACAGCAC 
CTGGATTTGG TTCAGACTTA GGCGCTGAAA 

55 



TGGACCATTT GAGGTTGAGT CTAAATTGGT 6 7 20 

TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6 7 30 

AGGATATGAA CCAACAGACG AATTAAAAGA 6 34 0 

GTCGGCACAT GCAGCACCAC GTGAAATCGA 6 900 

AGGTAAAATT ATGAGACGTG TATTAAAAGC 6 960 

AAGTACAATG GAATAATGAC ATGAATGTTA 7 020 

ACAACATATT TCATGTCAAT GTTTAAATAA 708 0 

ACTGTTAACT TGTGTCATAT CATTTCGTAG 714 0 

ATGGCATATC TTTATATTGC TTTTTATTTT 72 00 

AATAATAGAA AAATATAGTT ATCAATAGTA 7250 

TTTAATAAGT GAATTTATTG CAAAAATGAA 73 20 

TAACAGTTCG TTATTATATG AATGTAAGTA 73 30 

AAATGCATAA ATTAAACAAA AAGGGGCGAA 744 0 

TTGCGAATCA ATCAACACTA C A A C CA ATT A 7 5 00 

AGGATGCATT AGAACCTTAT GGTCATTACA 756 0 

CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 76 20 

AAGGTAAATC AACGGTTACA GTTGGTTTAG 7 6 BO 

TTArGGTTGC ATTAAGAGAG CCTGCTTTAG 774 0 

CTGGTGGTGG TTATGCGCAA GTCTTACCTA 7 800 

ATTTCCATGC GATTACAACT GCAAATAATG 7360 

ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7 920 

TGAATGATCG TGCACTTAGA CATGTAAACG 79 BO 

CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8 04 0 

GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGATGCAAT AAAACCAAAC TTAGTACAAT 822 0 

GTGGACCATT TGCGAATATC GCACACGGTT 32 80 

GTGATTTAGC TGATATCGTT GTAACGGAAG 334 0 

AATTCATGGA CATTAAAGCG CGTGAAGCAG 84 00 
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GTGTAGCGAA AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 852 0 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 8580 

5 

ATGCATTTAT ACATGATACC GATGCAGAAG TAG AAT AT G T AAAATCTTGG GCTAAAGAAA 864 0 

ATAACGTACG AATTGCCTTA ACTGAAGTTT GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 870 0 

TAGCAAATGA AGTATTAGAA GTCATTGATC AACCTAATTC ATTTAAACCT TTATATGAAT 376 0 

10 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8 820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 88 8 0 

fS ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAAACGTTGT 894 0 

TAGGTGCACC ATCAGGATTT GAAATTACAA TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 900 0 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 90 60 

20 CAGCATTAAA CATGGATGTT ACTGATGATG GTCATGCAAT TGGGTTATTC TAATAAATCA 9120 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TATTTCAGCA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 924 0 

25 ACAATGTTTG AAGGTGGTTA ATTAATGTAT ATGAAGATAA AAAGTGGGCT TGAAGAATAG 93 00 

GAAAGCGATG CAATGAATAT TCCATATTAA AAAAAATTAA TAAAATAGGT TGCAATATTT 93 6 0 

AATTGGGATG CGCTACAATT AA CACT AAT A ATTGATATTG ATAATTATTA TCAATTAAAT 94 2 0 

30 

ATAATCTTAT AGGAGTTGTT AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 94 8 0 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCGGTCAT TGTCAGTACA CTATTTTTAA 954 0 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

35 

AAAATCAAAA TAATAATGCA ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGctACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

40 ATGCAATTAA AGATCCTGCA TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 978 0 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACGCAGTA CTATCACTTT TTCAGCATCA 984 0 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

45 CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

TTGTAT CAT A TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

50 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 1014 0 

CAGATACAAA TGATGCAGTA GTAACGAATG ATCAATCAAG TTCAGTCGCA AGTAATCAAA 10200 
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AGGCAACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAACGAAT GCAGATCAAG 10320 

CGTCAAGCCA ACCAGCTCAT GAAACAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 103 80 

AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AGTATCCACC AGCAGATGAA TCACTACAAG 10440 

ATGCAATTAA AAACCCGGCT ATCATCGATA AaGAACATAC AGCTGATAAT TGGCGACCAA 10 500 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAGACAGTT CTATCATTAT GCTAGTACTG 10 560 

TTGAACCAGC AACTGTCATT TTTACAAAAA CAGGACCAAT AATTGAATTA GGTTTAAAGA 10620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 10680 

TAGTATCATA TGATTCTGAT AAAGATTATG CCTATATTCG TTTCCCAGTA TCTAATGGTA 10740 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACATCCAT GAAGACTATG 108 00 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 108 60 

20 AAGAAACATA CAATTTACAA AAATTATTAG CTCCGTATCA CAAAGCTAAA ACGTTAGAAA 1092 0 

GACAAGTTTA TGAATTAGAA AAATTACAAG AG AAATT G C C AGAAAAATAT AAGGCGGAAT 109 80 

ATAAAAAGAA ATTAGATCAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 11040 

25 CGGAATTTGA AAATGTtACA CCTACAAATG ATCAATTAAC AGATTTACAA GAAGCGCATT 11100 

TTGTTGTTTT TGAAAGTGAA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGAACATC 11160 

CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGGATGACA 11220 

30 

GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGATCCTA 11280 

AAAATAATTC TAGAACGCTG ATTTTCCCAT ATATACCTGA CAAAGCAGTT TACAATGCGA 11340 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTCA ATATCATGTC AGAATTATAA 11400 

35 

AT CAGG AT AT CAATACAAAA GATGATGATA CATCACAAAA TAACACGAGT GAACCGCTAA 114 60 

ATGTACAAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

40 CTGCAACAAA TCCTAAAGAT GCGTCTGATA AAGCAGATGT GATAGAACCA GAGTCTGACG 115 80 

TGGTTAAAGA TGCTGATAAT AATATTGATA AAGATGTGCA ACATGATGTT GATCATTTAT 1164 0 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 11700 

45 TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 11820 

ATATTGCCGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA GTGAAACAAA 118 80 

50 

ATTATAATAA TACAGACAAA GTTACTGACA AAAAAACAAC TGAACATCTG CCGAGTGATA 11940 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAGAAAA AGCCGGCA CA C CATCGAAAG 12000 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 1212 0 
AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 12173 

5 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

, s (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CAACATTAAT ATTGATATTA AATCTTCCTG GATAACGTGC TTGTTCGAGT GATAAGTATG 6 0 

CACGCACTTG ACTTAACTCT TTATCTAAAG TAATCGTATG TTGCTTAGAG CCTTGTAAAT 12 0 

20 TCGCTCTGAA AAAATAACTC AATTCTAATA GTAACTCTCG TGCCTTTTCG CTATTTATTC 180 

TAACTAAAGC TGAGATCGTG TTAATTGAAT TGAAGAAAAA ATGTGGACTC ACTTGTGCCT 24 0 

GTAATGACTT AATCTCAGCA TCTTTCAATA ACTTACTTTG CGTTTCGGCT TCACCAAGTT 3 00 

PS 

CAATTTGGCT ACTAAAAATA TTTGCCAATC CTTCTGCAAG TTGACGTTCC ACAAAAGTTA 3 60 

AATCATTAGG GTTTGTAAAA TACATCTTCA ATGTACCGAC GATAGAACCA TGCATCTCAA 420 

GTGGTATCAC GATAGCTGCT CTAAGCGGGC AATTCGGATG ACTACAACCA ATCTCTTCTT 4 80 

30 

TAGTATGAAC TTCTTTCAAC TTTCCTGATT TCAATACATC TTTAGACAGA CTTGTTAATA 54 0 

TTTCATTTGT TGGTATGTGA TGATCACTAC CTGCACCTAC ATGCGATAAG ATTTCATTTT 6 00 

TGCTTGTAAT TGCTACGGCA GATACTTTCA TTAAATTTTT AATAATCATC GCAATTTGCT 660 

35 

GTGCCGATTC TCTATTCAAT CCTTCTTTAA AATACGGCAA TGTCTGGTTC ATCAATTGCA 72 0 

GTACATCATG TGTTTGAACA GCCTTCATTT GCTCCTCTTG CTTTAATGTT GAAATGATAA 780 

40 TAGACATAAA AATCGCCGTA CCAACGCTAT TAACAATAAT CATTGGT AG T GCAATTAATG 84 0 

ATATGAGGTC AACCGCATAT GCTTTGTCGT GGGAAAATGT TAAAATGCTC AACATTTGAA 900 
TCATTTCCAT AACAATTCCA ATCATGGCAC TTTTCGCAAT ACTCGGGTAA CGCTTGCGTC 96 0 

45 TTTGAGCTTG TAAGCCAAAA TAACCAGCAA TTATACCAAT AAATATAGAT GAGATAAGAT 1020 

AAACTTGTGC ATCCGCCCCA CCCATATACA CTCTGAAAAT ACCTGAAATA ACGCCAACAA 1080 

ATAGACCTAC AAAAGGGCCA CCAACTAATC CTGCGACACC TATCGTTAAT ACACGTGTGT 114 0 

50 

TAGCTAAAGA TACATCATCA TCTAAACGGA AGTACACACT TCCTGACAAA CTATGTTGAT 1200 

GATCGATGAC GATACCAGTT AAATTAGACA TTAAGGCAAA CAAACTGAAA ATAATACATA 1260 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 6 0 

15 TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 120 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 180 
CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 24 0 

20 CTAATAAAGA GGAATTAAGT TT AG CAT ACT CACCTGGCGT TGCTGAACCG TGTAAAGATA 3 00 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTG CAGTTA 3 60 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 420 

25 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TAT CAATGGG GTGCCTATTG 4 80 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 54 0 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 6 00 

30 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 
CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

35 TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 84 0 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 900 

40 CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 960 
TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 10 20 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

45 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 114 0 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 12 00 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

50 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 
GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 13 80 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT 
AAATAAATTT AAACCCGTTT TTAACTGGTC 

5 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT 
GAAATATCTT ACAGTACAAG ACTCTAAAAA 
GTGTCCAAAG TGTAAGAAAA TTATGTACAC 

10 

CTTTAATTGT GAT CAT CAT A TTGCTTTAAC 
TGAAGGATCA TTTACAGAAT TCGATAAGGG 

75 aAGTTATTTA GAAAAAATTG AAAAGGACCA 
GACTGGTACA GCACAACTAG ATGGTATGAA 
TAGAATGGGA AGTATGGGAT CGGTTATCGG 

20 CACTGAGAAC CGTTTACCAT TTATTCTTTT 
AGGTATTATT TCCTTGATGC AAATGGGTAA 
CGCTGGACTA TTATATATAT CATATTTAAC 

25 

TTTTGCATCA GTTGGTGATA TAAATTTAAG 
TCGTCGAGTT ATTGAACAGA CAATAAACGA 
ATTTTTATTA GAGCATGGAC AATTGGATAA 

30 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA 
CCACTTTTTG AAATTCGAAA TAAAATTGAA 

35 GTGGATTTAC AAGAAGAAAT TGACATGCTT 
ATATATACAA AT CTAAAACC ATGGGATCGT 
ACGA^CCTAG ATTaTATTCC ATATATCTTT 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT 
ACAGTTaTTG GACAACAACG TGGAAAAGAT 
ATGGCGCATC CAGAAGGTTA TCGAAAAGCA 

45 

AATCGTCCTA TCTTTACATT TATAGATACA 
GAACGTGGAC AAAGTGAATC TATCGCAACA 
CCAGTTATTG CGATTGTCAT TGGTGAAGGT 

50 

GCCAATAAAG TATTGATGTT AGAGAATAGT 
GCGGCATTAT TATGGAAAGA CAGTAATTTG 

55 



AAACATTCAA CAGTCAAACG AATATAAATC 15 0 0 

AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 156 0 

TAAAGATTTT TTTAATCGAA CAAAGAAAAA 16 2 0 

TAATGATGTG CCTGCAGGTA TTATGACTAA 1630 

AAAAGAATTA GCTGAAAATT TAAATGTGTG 174 0 

TGCGTATAAA CGTATAGAAG CAATTTCTGA 180 0 

AATGACCTCT GCGAATCCAT TAGATTTTCC I860 

ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TGAAAAGATA TGTCGCATCA TTGATTACTG 204 0 

CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 210 0 

AACCAGTGTA TCTTTAAAAC GTCATTCTGA 216 0 

ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 2 28 0 

AAAATTGCCA GATGATTTCC AAACTGCAGA 2 34 0 

AGTTGTACAT CGTAATGATA TGCGTCAAAC 24 00 

GGTGACTAAA TAATGTTAGA TTTTGAAAAA 24 6 0 

TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2 52 0 

GAAGCGTcAT TGGAACGAGA AACTAAAAAA 2 580 

GTGCAAATTG CGCGTTTGCA AGAAAGACCT 254 0 

GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2760 

ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

TTACGTTTAA TGAAACAAGC TGAAAAATTC 2880 

AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2940 

AATTTGATTG AGATGGCTTC ATTAAAAGTA 3 00 0 

GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3060 

ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 
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GGTGCACATA AAGATATTGA ACAGCAAGCT 
TTAGATTCAC TTGAGTCATT ATCACGTGAT 
5 AGAAATATCG GTTCTTATAT AGAATAATCA 

GGGTTTTACC ATAAATTGAA GTACATTAAA 
TAACTAAGAT TAGTAGTCAA AATTGTGGCT 

10 

AATTCAATAT TTAAAACGTT TACATCAATT 
CAATTTGTGG TAATTTAGAG TTAAAGATAA 
GAAAATTGCA GTTTTAACTA GTGGTGGAGA 

15 

AGTTGTTCGT ACAGCAATTT ACAATGAAAT 
AGGATTGTTA AATGATGATA TTCATAAACT 

20 GCGTGGAGGT ACATTCTTGT ATTCAG CAAG 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG 
TGACGGTAGT TATCGCGGTG CACAACGCAT 

25 CGGTATTCCT GGTACGATTG ACAATGATAT 

CACAGCATTA AATACGATTA TTGGCTTAGT 
CGCACGAACA TTTATCATTG AAGCAATGGG 

30 

TGGATTATCA GTTGGTGCTG AGACAATTGT 
AATAGCTGAT AAAATTGAAC AAGGTATTAA 
AGCAGAAGGT TGTATGACTG CGCAAGATTG 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT 
TAGASTTTTA GCATCACGTT TAGGTGGATA 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA 
TGATGGTAAA GATCATAAAT TTGATTATAG 
ATAAGATTTC AGGAGGAATT ATAAAATGAG 

45 AGCTTCAGAA TCAGAAGAAA TGATTGAGAA 

ATTAAACTTT TCACATGGTA GTCATGAAGA 
AGTAGCTAAA AGATTAGACA AAATTGtAGC 

50 

TCGTACGCAT AATATGAAAG ACGGTATCAT 
TAGCATGAAT GAAGTTGAAG GAACACCTGA 

55 



TTAGCTATTA AATCAGCGTT TGTTGCACAG 3 3 00 

GAAATTGCTA ATGATCGCTT TGAAAAATTC 3 3 60 

ACTTGAGCAT TTTTATGTTA AATCGATACT 3 420 

ACAATAATTT AATATTTAGA TACTGAATTT 3480 

ACTAATCTTT TTTTAATTAA GTTAAAATAA 354 0 

CAATACATTA GTTTTGATGG AATGACATAT 3 600 

ATCAGTTATA GAAAGGTATG TCGTCATGAA 3 660 

TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3 720 

TGAAGTTTAT GGTGTGTATC ATGGTTACCA 3780 

TGAATTAGGA TCAGTTGGGG ATACGATTCA 3 840 

ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3 90 0 

AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3 960 

CAGTGAGGAA TGTAAAGAAA TTCAAACTAT 4 020 

CAATGGTACT GATTTTACAA TTGGATTTGA 4080 

CGACAAAATT AGAGATACTG CGTCAAGTCA 414 0 

CCGTGATTGT GGAGATCTAG CATTATGGGC 4 200 

AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4260 

ACGTGGTAAG AAACACTCAA TCGTTCTTGT 4 320 

TCAAAAAGAA TTATCACAAT ACATCAATGT 4380 

TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 444 0 

TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4 500 

AATTGTAGCA ACATCTTTTG ATGAAATTTT 4 560 

TCTATATGAA CTTGCTAACA AGTTATCTAT 462 0 

AAAAACTAAA ATTGTATGTA CAATTGGACC 4 680 

ATTAATCAAT GCTGGTATGA ACGTTGCACG 4 74 0 

GCATAAAGGT AGAATTGATA CAATTCGTAA 4 800 

AATTTTATTA GATACAAAAG GTCCAGAAAT 4860 

TGAACTTGAA CGTGGCAACG AAGTTATTGT 4 920 

AAAGTTCTCA GTAACATATG aAAACTTAAT 4 980 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 54 00 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 54 60 

AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACGTG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5 700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTT CAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5 94 0 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6 000 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6 060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 624 0 

AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 63 60 

TGCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 64 80 

45 AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 654 0 

TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 560 0 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

SO 

TTTTGAGCCA AG T A CG CAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 

ATCCCTAAAA AAACAGCAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 

55 
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t CTTCGACTG GCACTGCTCC CTCAGGAGTC TCGCCATTAA TACTACGTAT TAACATGTAA 6 900 

TTTTACTTTT ACATACTTTA AAAAAATAAA ACACTTTGCC CAACTTGCAC ATAAATGTAA 6 96 0 

5 

AATTCAATAA AATGAATTTT CTGTGTTGGG TCCCTTCGTA TAATTTAATA AATACCACTA 7 020 

AACTAAATTA ACGAGGTGCC TTATGTATAA AATTTATAAC ATGACCCAAC TTACACTACC 70 BO 

AATAGAAACC TCTGTTAGAA TTCCTCAAAA TCATATTTCG CGATATGTTA ATGAAATTGT 714 0 

10 

TGAAACGATA CCTGATAGCG AATTCGATGA ATTCAGACAT CATCGTGGCG CAACATCCTA 7 2 00 

TCATCCAAAA ATGATGTTAA AAATCATCTT ATATG CAT AT ACTCAATCTG TATTTTCTGG 72 6 0 

75 TCGTAGAATA GArAAATTAC TTCATGACAG TATTCGAATG ATGTGGTTAG CTCAAGATCA 7 32 0 

AACACCTTCT TATAAAACTA TTAATCGTTT TAGAGTGAAT CCTAATACTG ATGCGTTAAT 73 80 

TGAATCTTTA TTTATTCAGT TCCATAGTCA ATGTTTAAAG CAAAATCTTA TTGATAATAA 74 4 0 

20 TTCAATTTTT ATTGATGGTA CAAAAGTAGA AGCTAATGCC AATAGATATA CATTTGTGTG 7 5 00 

GAAGAAAAGT ATTCAAAATC ACGAATCGAA ATTGAACGAA AATTCAAAAA CATTATATCG 7560 

TGACTTAGTT GAAGAAAAAA TAATACCAGA GATAAAAGAA GATGGAGATA GCGATTTAAC 7620 

25 

AATAGAAGAA ATAGATTTAA TTGGTAGTCA TTTAGATAAA GAAATCGAAG ATTTAAATCA 76 8 0 

TTCTATTGAG AACGAAGATT GTGCTCAAAT TAGAAAACAG ACCCGTAAAA AAATAACTGA 774 0 

GATTAAGAAG TTCAAAAAGA AATTTGATGA TTATTCCGAA AGAAAAAATA AATATGAAGA 7 BOO 

30 

ACAAAAATCG ATTCTTAAAG ATAGAAATAG TTTTTCTAAA ACTGATCTGA TCATGATGCA 7960 

ACTTTTATGA GAATGAAGGA AGACCATATG AAAAATGGCC AACTTAAGCC AGGATACAAT 7 92 0 

35 TTACAAATAG CGACAAATTC TCAAAAATGT TTTATCCTAT GACCTATTTC AA 7 972 

(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 6 0 

ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

50 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTTATGCTCA ISO 

CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 24 0 

55 
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AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCGG AATCATAAAT CCTGTTGGCC 360 

CTTTTGGACC ATTATATGAA GCTTTATTAG TCGTTGCTGG ACCTGCAATA TCGAAATGAA 420 

TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 54 0 

TTGCAGTAAT CGGTAATTCA AAT AC CATTT CATCGACTTC AGAACTTATT TGTAATATAT 600 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 6 60 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTT CTA 78 0 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 34 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

20 CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 96 0 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 1020 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCGGTTTAT 10 8 0 

25 GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 114 0 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 12 00 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 12 6 0 

30 

aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 13 80 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 14 4 0 

35 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

TTGGAAAAGA TGTCCCCaTA TTTTCAACAT ATCTTGATAA tACGTGTCTT TAAGTTTCCT i86 0 

40 AAACCAACTG TAATTAAACG ATATGTTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 16 80 

TCGATATGAT TAAAACTAAT GCGCTCTAAC TGATTTAAAT GTTCTGGTAT AC CAAT AAT C 174 0 

45 AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT 1800 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCTcAGT GACACTGAAG 1860 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTATT TAAAGCCTAG 192 0 

50 

AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

ATCACTTTTT TCATGAATGC TG CTTTTTT A CCAGCGATAG GTTTACCAAA TACCATACCT 204 0 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTGCATT 216 0 

GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 222 0 

^ TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 228 0 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 234 0 

GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTACAAAA 24 0 0 

10 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 24 6 0 

AAGTAGCTAA cTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2 52 0 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 2 58 0 

15 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 264 0 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 27 00 

20 TCGATATCAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGGAA AGCATGATCT 2760 

TTCATACCTT CGATGCCGAA TGTTTCACTA AC G AAA C CT A ATG CTACT AC TAAAATATCA 2 82 0 

AAGT CAT AAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 2 880 

25 TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 2 940 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT 3 0 00 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3 06 0 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 312 0 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC 3175 
(2) INFORMATION FOR SEQ ID NO: 314: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 12 0 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 180 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATGCCACAT 24 0 

GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GG ATT ATT CA CCGTGAACCG 3 00 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 4 80 

GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 54 0 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 
(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 6 0 

AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 

ACGAATGCAA AG CCAAAG AT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 

GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 240 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 3 00 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 3 60 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 4 80 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 54 0 

AG ACT AAA CC TAAAAGTAAC GTAATGAT CA TCCAAAACAT CATTAACTTT TGTTTTTCTT 6 00 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 660 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 840 

TTGTATCATG ACTCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 

CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACAGTTTCG 102 0 
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w 



TTGT7TGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 114 0 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 1200 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 1260 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 1320 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 13 80 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 14 4 0 

GGTAAGAAAC AAACGTTAAA TGCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 150 0 

/5 TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 1560 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 1680 

20 TTAGGGATAA ATGCTAATGA GAATAG CATT GGTG ACT CAA ATGTAATACG TCCTTTATAT 174 0 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA I86 0 

25 ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACTTTG ATGTCCGAAT 192 0 

AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 198 0 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATT GG CAT A 204 0 

30 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 216 0 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2220 

35 

ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 2340 

40 CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 240 0 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 24 6 0 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2520 

45 ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2580 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 264 0 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 2700 

50 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAACC AATAACTAAG 2760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTT AA C TAG TAATTGATCC 2 8 20 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT 2 94 0 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 30 0 0 

TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 306 0 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 3120 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 3240 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 3300 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAGTT ACGACCTCTG AACGTACCAG 3 3 60 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 34 20 

CTAATTGTGG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 34 30 

GACGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3 540 

AGAACCATTT GTATCCTGCA CTTACTGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3 600 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGc t AAAGCA GCAACGATTA 3 6 60 

TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3 720 

CGGCATTCTT ATTATAACTA T ACTTG T AAA TAAAAATGGC GAACATGCCA AGTACAACGA 3 78 0 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA G AACTT CTG A CTACTTGCTA 3 84 0 

CTGGCCCTTT TGCGTTGAAA ATTTCTATAT TTGAACAACC ACTAAGTAAA ATTAGTGTGC 3 90 0 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACACTTTTTT GACCTCCTAA TACTACAAAT 3 96 0 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4 02 0 

AATTAATAAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4 080 

TGTTAAAATT ATGTGTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 6 0 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 12 0 



55 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAATG TATCATTTGT 24 0 

TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

5 CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCGATT TTCATTATTA GTACCAAATC 36 0 

AAG CGGC ACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 4 20 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA G AAAT CAT AT 4 80 

10 TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 54 0 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 6 00 

CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 6 60 

15 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 7 20 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT AATACACCAC 78 0 

AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG 84 0 

20 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 9 00 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGTTGAACAA TTAGGAAGTC 96 0 

2$ AATTGAACTT GCCTGACTTT TTAGAAGCAG AACGTGAACA AATTGAAAGT AAGTTACACG 102 0 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 10 80 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 114 0 

30 TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 1200 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 12 60 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTTC TT CAA C G ATT 13 20 

35 ATTAGTGCAc AGATC CTGT A AGAGTATTTG AAGACTTCGC GACATTAGAT TTGATTTCTC 13 80 

ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 144 0 

GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

40 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA 1560 

AAATATTCCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 1630 

45 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 1740 

GTTTCGATgc TTCGCCT7AAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

so AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACATTT GAATACAGGG ATGTCATTTA 18 60 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GTCAGATTAT CGAGAAGCGC 1920 

55 
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GTCATCAACG TTTTATGGCA CAGCTTGATT TTGGCGGTGT GCCATTTGAA AATGTTATGA 2 04 0 

AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

AGGAGGGGCG TCATCATGAA TA7TGTATTA TTGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AG AATTG CT A TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGGG ACATCAAATA 22 2 0 

GAGTTGATGG ATTTACGAGA ACTTGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 22 80 

ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 2 34 0 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTGCTTTGA AAAATGTGTT TGATCTACTT 24 00 

CCAGTCAATG CGTTTCGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 24 6 0 

CATTATTTAA TTCCTGAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AGCACATACG 2 52 0 

ATGCAAACGT ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2 580 

GATGTTGTAT TTCGGTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC CAAAGTACAA 264 0 

CAACAAGTGT TTGAAGAAGA AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2700 

TCGGCACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAGCCAATGA 27 60 

ACGACGATAA AGGCAAGTGA TGACAAGCAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 2 82 0 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 2880 

ATTAATAAGC AAATTGTT G A TACAGTATTT AAAATATTTA GCAATGCTGA CGGAATAGTT 2 94 0 

ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATGCTTATT 3 00 0 

CGACCCTTGC TAGATGATTC TGATGAACGG AAACG CATAA TTAAAATAAG TACGATGGTT 3 060 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTGCATCA 312 0 

CTACCACCAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACACCAGAC 3180 

CAAGCGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAGT 3240 

GAAAATATCG TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 3 300 

GATTTGAATA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACGCC TGTGCCGACA 3 36 0 

CCTAGTAATA ATGCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 342 0 

ACAATACTGA AAATAACGAC ACCTACATCG ATGG CTAAT A GTTTACTACC TTGACCGATA 34 80 

ATATCGAATA TATTGAGTTT AAGTCCATAT AGGATGATTG CAAATCTTAA TAAATATTTA 3 54 0 

GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAAAGTG ACGATATAGA 3 600 

ATAGCGATTA ATATCGCGAT AGTTAATGCG CCAACCTTAT CTAGGATTGG CAATTTAGCT 36 60 

GCTAAAAAGC TAAATAATGC GACTATAAAT GTTAGTGATA GCCCAATCAT AAAATGCTTA 372 0 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT 
GGAAACTTGG ACATCAATCA AAGTAATAGG 
5 AAAGAAGCTG ACGAAAAGTC agCTTGcAAT 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC 
AATTTCGAAA AGAAATTCTA CAAGCAATGC 

10 

CAATGCGTAA CATTAATATG TTATTATAAT 
TGAGGATGTG ATGGTATGTT TGG AATG AAA 
GAAGCTCATG ACACAGAAGC GCTTTTCAAT 

IS 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG 
AGAGGACTTT TGCAATTTGC TGATGGTAAT 
ACGCTAGTTG GTGTCATCGG TTTACATGAA 

20 

GGGTACTATT TAGATAAAGA ATTTGAGGGT 
TTGATAAAGT ATTGTTTCGA AGAGCTTGAC 

25 AATAATGAAA AAAGCCGGGC TATTCCTGAA 

CGTGACAATG AATTACTAAA TGGTATTTAT 
TCAGAATACG ACCAAAAATG ACAAATTAGA 

30 CAGCGCTCTT TTATTTAATT TTTGAAAATA 

ATATAATGAT TTTGATAATT ATTATCAACT 
CAGCCGTTTT AAGTGTAATG TATTATTTTA 

35 

AAGAACACAT CTATGGGGAT AATAGAATTT 
ACAACGCTAT TATTAGCATC AACGTTATTA 
AAGGATGATT CAAAGACATC GAAAAAAGAT 

40 

ACAAAAGCAT ATGATAAATA TACTGATGAA 
AAATTTGTTA AAGCGATTGA AAATAATGAT 
GTTCGTATGT ATTATGAACG CTCTGAACCA 

45 

AAAATTGATG CACGTCTTGC AGATATGAAA 
TATCATAAGA TTGAAAAAGC ATTATACGAA 
50 GATGCACAAC AATTATTGAA AGATGCAAAA 

ATCACACCAA AATTAATGTT ACAAGGTTCT 

55 



AAAGTAGTGA TTAAATTAAG TTGTGTGATA 3 84 0 

CACTACAACG CTTATTGGCG GGGCCCCAAC 3 90 0 

AATGTGCAAG TTGGGGATGG GCCCCAACAT 3 96 0 

AATGCAAGTT GGCGGGGCCC CAACATAGAG 4 020 

AAGTTGGGGA AGGACAACAA ATTTAAGATA 40 8 0 

GATAATTTAC AGAATTATAT GAAAAATGAA 414 0 

GTGAATGAAC AAATAACATT AAAAATTTTA 4 2 00 

TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4 2 60 

CAACCATCAG ATACGCGTGC ATTTATTAAA 4 3 20 

GGATTTCAGT GTGGCATTTG GTATGAAGGA 43 80 

ATTAATCACA TGCACAGAAA AACTTCATTA 444 0 

CATGGGATTA TG ACACAAG C AGTTGAGGCA 4 500 

TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4 560 

AGG CTGGG AT TTACTAGAGA AGGTATGTTA 4 620 

TCATCGAGTT ACATCTATAG TTTATTAAAA 4 680 

CTTACAAAAG AGTGATGACA TTTAAAATGG 4 74 0 

AAAGGTTGTT GACAGTATTA TTTTATAACA 4 8 00 

AGATGATGTT TATGGGAGGA TGCTTTAAAA 4 86 0 

GCGTGTAGGG AATGCGAAAA TAATATTTAT 4 920 

CTATAATGAG GTGTCAAAAT GAAAAAGTTA 4 980 

ATTGCTGCAT GTGGGAACGA CGATAGTAAG 5040 

GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

CAGTTAAATG AATTTTTAAA AGGTACAGAA 5160 

ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 5220 

GTTGCAGAAG CATTTGGAGA TTTAGATCCT 5280 

GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 534 0 

GATAAGAAAA TTGATGATGT GACTAAAAAA 54 00 

GAATTGCATG CCAAAGCTGA TACATTAGAT 54 60 

GTTGACCTAT TAAATGAAGT TGCAACTTCT 55 2 0 
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GTTGAAGGCG CACAAAAAAT TTATGACTTA TTTAAACCTA TTTTAGAGAA AAAAGATAAA 564 0 

AAATTAAGTG ATGATATCCA AATGAACTTC GATAAAGTGA ATCAATTATT GGATAAATAT 5700 

5 AAAGATAACA ACGGCGGTTA TGAGTCATTT GAAAAAGTAT CGAAGAAAGA CCGTAAAGCA 576 0 

TTTGCGGATG CTGTTAATGC ATTAGGAGAG CCACTAAGTA AAATGGCTGT GATTACTGAA 5B2Q 

TGACAAATTA TGAACAAGTT AACGATAGTA CGCAATTTTC AAGACGTACA TTTTTGAAAA 5 380 

10 TGTTAGGTAT TGGCGGTGCC GGTGTTGCAA TTGGCGCAAG TGGTGTTGGT AGCATGTGGT 594 0 

CTTTCAAATC AATGTTCAAT ACACCAGAAG ATCCGGAAAA AGATGCGTAT GAATTTTATG 600 0 

GTAAAGTGCA ACCAGGCATT ACCACACCCA CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 6060 

15 

ATTTGAAGTC AAAAGATAGA GATGCAATTA AGGCAATGTT TAAAAAGTGG ACGGTTATGG 6120 

CTGATCGTAT GATGGATGGT GATACAGTTG GCAAGCCGAG TAACAATCCT TTAATGCCAC 6180 

CAGTAGATAC CGGTGAATCG ATAGGATTAG GTGCAAGCAA GTTAACGATT AC CTTTGGGA 6 24 0 

20 

TTAGTAAGTC TTTGATGAAG AAAATTGGGT TATCTAGTAA AATTCCCGAT GCCTTTAAAG 63 00 

ATTTACCGCA TTTTCCGAAT GATCAGTTAA TAGACGATTA CAGCGATGGT GATATTATGA 63 6 0 

25 TTCAAGCATG CTCAAATGAT TCGCAAGTAT CCTTTCATGC GGTTCATAAT TTAGTTCGTC 642 0 

CATTTCGAGA TATTGTTAAG GTACGTTGGG CGCAATCTGG TTTTATCTCT GCTAAAGGTA 64 80 

AGGAAACACC TAGAAATTTA ATGG CATTT A AAGATGGAAC AATTAATCCT AGGAAGAATA 6 54 0 

30 ATCAACTTAA AGATTATGTG TTTATTGATG ACGGATGGGC G AAA C ATGG A ACTTATTGCG 6600 

TTGTCAGACG TATTCAAATA CACATTGAAA CGTGGGATCG TACTGCGCTG GAAGAACAAG 6 6 60 

AGO CT AC ATT TGGTCGGAAA CGACATAGTG GTGCACCGTT AACAGGTGGG AAAGAGTTTG 672 0 

35 ATGAAATTGA CTTAAAAGCG AAAGATAGTC ATGGCGAGTA TATTATTGAT AAAGATGCCC 6 78 0 

ATACGAGGCT AGCGAAAGAA GCAAATACGT CAATTTTACG TAGAGCCTTT AATTATGTGG 684 0 

ATGGTACGGA TGACCGCACA GGTAACTTCG AAACAGGCTT ACTTTTTATT GCTTTTCAAA 6 900 

40 

AAGCGACAAA ACAATTTATC GATATACAAA ATAATTTAGG TAGTAATGAT AAATTAAATG 6 960 

AATATATTAC ACATAGAGGT TCTGCTTCAT TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 7020 

GATACCTTGG TGAAACATTA TTTGACTAAA TTTGTAGCAA TGCTAATAAC TGCTGCTATG 7080 

45 

GTGTGTAGCT TTGGGTTACT GAAAAGTCAG GCAGCAGAAC AACAAAGTAT TAGTGATGTA 714 0 

TATAGTGTGA TAACGGATGC GAAATCTGCA CTTTCTAATA ATTCGATATC GAATGACAAT 7200 

60 AAGCAGAAAG CAATTGAGCA AGTGGTAAGT GCAGTTAAGA AATTATCGCT TGAAGATAAT 726 0 

AGTGAAAGTA ATGCTGTCAA ATCAGATGTG AGAAAGCTTG AAGATGCAAA AG CGAATG AT 732 0 
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GCTAGTAAAG ATG CGGGTTC T A A A A TT AAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 7440 

GCTGCGATGA CAAAAGCGAT TAAAGATAAA AATAAAGCGG AATTAGAATC TTTGAACAAT 7500 

5 AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 75 6 0 

TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 76 20 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 76 30 

10 AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGCGTTA 7 74 0 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 7 8 00 

CATTTTATAG AAA C TTGG C C GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG 7 860 

15 

T AT AC G AAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7 920 

GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7980 

AGTTATAGCT TCGTCGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA B04 0 

20 

ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 

., 5 GAAACTTTAG GGAATAGTGG CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCGTT 8220 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 82 80 

TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 8 34 0 

30 TTAGCGACGA TTGGTTTAAT ATCTGTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 84 00 

ATGGGGATGA TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 84 60 

GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCTATTTTC 8 52 0 

35 TATATATTTA GAGTGTTGTC GATCTTTATT TTTATTATGG GATTCAAAAT GCTTGGCGTA 8580 

AG T ATT CAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 8640 

ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

40 

ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 87 60 

GTTTTTTTTG TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA AT C AC AG CAT 88 20 

TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 88 80 

45 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8 94 0 

GGTGGTGCAA TGATACTTGC TAATACAAAG CAAGCGAAAT AAATATATTT ACGrTAATGT 9000 

50 TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 906 0 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 912 0 
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GCAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 924 0 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 93 0 0 

5 AATTG AT AAA 9310 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3458 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 

CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATT AATTG AAGCTGAGTT 12 0 

20 

ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 

AAG TATTTT A AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 24 0 

25 TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 300 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 36 0 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 42 0 

30 CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 48 0 

GTTTGATTTT AATCATGGAT AT AATTT C G A ATTGAATATG TTGATAGATC ACTATAAAAG 54 0 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 6 00 

55 GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 66 0 

AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 72 0 

GTACG AG CG A TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 78 0 

40 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 84 0 

TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 96 0 

45 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 102 0 

TATTGATATT CGAAGTGGAG AAATT CATTT GGTATTAGAT GATGAT AG CC TTGATATATT 108 0 

so AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 114 0 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 12 00 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA TTAGTTGTCT 13 2 0 

ATCAACTTGA TCGGTTAGGT AGAACGACAA AACTATTaAT TGAATTaTCA CAATGATTCG 13 80 

5 ATGATAACGG AATTGACTTA CAAATTAGTA ACATGAACAT TTCAACGAAA GACACAATGG 14 4 0 

GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAGAAGTTAA TTTACTATGT 1500 

GAGTGTATAA AATAGACTTA GCAGCAACAA GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 156 0 

to 

CTTTACCAGA GAATAAAAAA TGAGAAATTA AATTTTTATA TGATGAACAA ACGATAACAG 162 0 

GGG AAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTATCGA GTTGATTAAG 1680 

AAATGAAAAA ACTTATACTA TGAATTACTG TTTAAAAGTG TGCATGTTAT AATATTTATT 174 0 

15 

GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTGATGCCT ATGGTTATAG 1800 

TTGTTACTCC TAGnAAAGGA CTAGCATGTC TGATTTTGAA ATG CTTATGG TTGTATTAAC 1860 

2Q AATCATTGGT TTAGTATTGA TTAGTACTCA AG AC CAT AAA AAATAaCCTT CTATTCGCTT 192 0 

TG AC CGG CAT TTTTGAAGGC TATTTTTAAA TAAAATATAA GGTCACCGTC TTTTTAACGG 198 0 

GCTCATTAGG GTAACATGTT TCCGAGTGTT GCCCTTTTTG TGTTTCAAGA GTTAATGATT 2 04 0 

25 TTCATCTTTT GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTTCTT ATGAATTGTA 2100 

GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT ACAG C AG AAA ATACAATTAA 216 0 

AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 22 2 0 

10 

AATTCACTTA GTTATTTTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 22 8 0 

ACAG TTg GAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 2 34 0 

AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TGTTGTATTA 24 0 0 

35 

TCATTTAATG AATCTTGTGC TTTTTCTGTT TTTGTTTCAA CAGATGTTGT CACTTTATCA 24 6 0 

TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTT TATTCTTGTC AGCTGTTTTT 2 520 

ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATTGAACGAT ACCCATTAAA 2580 

40 

ATGATTGCAT GAATAACAAC GAATAAAATA ATTGTCATTA CTGTATAAAC TCTTATGATA 2640 

TTTTCTGCAA CACTTTGAGA GAGTGATTGC GTATTAAATG AAATCAAGTA ACCAACAGGT 2700 

45 GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 2760 

GTTTTTTGAC CATTTTTCAT TGCTTTAAAT CCATTAGCTA AGTGTGTTTT TACTTTATCG 282 0 

TT AG TTG ATT CaACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA 28 8 0 

50 ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATGC TTTGAGTTGT AATAGCAATA 294 0 

CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTG AATTCATAAT CATTTCTGAT 3 00 0 



55 



1176 



EP0 786 519 A2 



w 



15 



20 



45 



ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 3120 

AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 3180 

ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 3 24 0 

ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3300 

TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACTGCAAT TACAGCAAAT 3360 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 342 0 

TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 34 5 8 
(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT TATGAATATG TTCCAAAATC 6 0 

ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA TATTATATGT ACGCCAGAaA 12 0 

TATTTGCTAA ATTGTTACAT ACGATTGCAA CTAGAAATAT CGACATTCTA TCTGCCAATT 1B0 

ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA AAAGACACAT CTATACGATG 24 0 

ATCATTTACA ATGATTGCGT ATAGTGATGT GTCTTTTTTT GTATTATTTA TCTTCGGAcG 3 00 

ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG TGGTACTGTC ATACCTACTA 3 60 

CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC CATTTGTTGA TGCGTGAAAT 4 20 

ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA GCGTAATCTT GTTGTACAGT 480 

TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC GAGGTAATAT GCAGCTTTGT 54 0 

CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC GCTATCACCT TTTTTATTTT 600 

TGTAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT TTyCTTACGT CCAGGTAATG 660 

GAATATCAAA TTyCAAAATT AAGAATCTGA ATAAG 6 95 
(2) INFORMATION FOR SEQ ID NO: 319: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

AATATTGaTa TTATATATAG TCGAATCaAT GATGACCCaC ATCGACCtAA TGCAAATAAT 6 0 

5 AAGACAATCA AG CAGTT AAA TGACTTGTAT TGCTCGCGGA ATTTAAGATA TAATGAAATC 120 

GCATTCAAGA AATTTGATAG TCATTTGCTA TCAATTTCAG AAATATATTA TGAATTGCTA 130 

AATTTAATAA AAGCGAGTGA TCAGTATTAG AGAGAATAGA GCGTTAAGAC TCTATCG C CG 24 0 

10 

AAGtGCAAGT AATTTATTAC GAAACTCTCA GGcAAAAGGr TAATACTGTA ACGCGTTCCT 3 00 

GAATTGGTGA TTTATAAACA GGGTAGCGAT TGCTATCCTG TTTTTATAAT TTT AAGGGGG 360 

TATTTCAATG TCAAGTGATT TAAAACAAAC ACCTTTATAT CAAAATTATG TTGATAGAGG 420 

15 

TGCAAAAATT GTGGAATTCG GAGGATGGGC GATGCCTGTT CAATTTTCAA GTATTAAAGA 4 90 

GGAGCATAAT GCTGTTCGAT ACGAAATTGG CCTGTTTGAT GTTAGTCATA TGGGTGAAAT 54 0 

20 TGAAGTAACA GGTAAAGATG CTAGT CAGTT TGTGCAATAT TTATTATCAA ATGATACTGA 6 00 

TAATTTAACT ACTTCAAAAG CATTATATAC TGCTTTATGT AATGAAGAAG GCGGTATTAT 6 60 

TGATGATTTA GTAATATATA AATTAGCTGA CGACAATTAT TTATTAGTTG TTAATGCTGC 720 

25 TAATACTGAA AAAGATTTTA ATTGGATTTT AAAACACAAA GAGAAATTTG ATGTTGAAGT 7 30 

ACAAAATGTA TCAAACCAAT ATGGTCAATT AGCAATACAA GGACCAAAAG C t AG AGATTT 94 0 

AATTAATCAA TTAGTTGATG AAGAJcGTAAC TGAAA 875 

30 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

TCTTTTTACG AAGGCTACCG TTTAGGATCT GATAACTTAA CTGTAGGAGA AATTGTATTT 60 

AATACAGCGA TGACAGGTTA TCAAGAAACT ATTTCAGATC CATCATATAC AGGTCAGATC 12 0 

ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTG AAAGAAGCGA GTGCGCATCC AAGTAATTTT 24 0 

AGACAGCAAA AGACACTTCA TGACGTTTTA GAATTGCATC AAATTCCAGG G ATTG CAGGT 3 00 

GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 3 60 

ACTGATCGAA AAGAAGATAT TGATCAACTT GTCAAACATT TACAACAAGT AGAATTACCT 420 
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AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 54 0 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 600 

5 CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 780 

10 

AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 84 0 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 90 0 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 

!5 

CATCCTGAAG CAAATCCAGG AC CGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 102 0 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CAT AT CAATG CCTAAACGTA ATGATATCAA 108 0 

20 AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 114 0 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 126 0 

25 GTTAACTCAT GATTTTATAG CGCGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 138 0 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 144 0 

30 CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 150 0 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 162 0 

35 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 174 0 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1360 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AG C ATT AG AT 192 0 

CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGCG 198 0 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 2040 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 2 34 0 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 24 0 0 

5 

ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 24 6 0 

AGCATCAATT AAAAG AG CAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 25 80 

10 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 2 64 0 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2700 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 2760 

15 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAG CAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 28 80 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 2 94 0 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AG CAG A CAAA TTGGCTAAAC 300 0 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 3 060 

25 AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3 24 0 

30 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3 3 00 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGT CATT ATTCCAGGAA 3 3 60 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 342 0 

35 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CT AT ACT AT A AAATTAG CTA 3 4 80 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGGTGTGT 3 54 0 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

40 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 36 60 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 37 2 0 

45 GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 37 80 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 384 0 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900 

50 AAGTTGTTAA ATTGG CACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3 960 

GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4 020 
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TGACTAAAGG TAAAGAAGTA GAAAGGGATG GCTTCCAAAT TAGACGTACT ACAGTTGAAA 414 0 

ATGGTATTCC ATGTTTGACA TCTTTAGATA CAGCTAATGC CTTAACGAAT G T AATTG AAA 4 2 00 

5 

GTATGACATT TACAATGCGT CAAATGTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 4260 

TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 4 3 20 

TAAAGGGAGA CaTATATGAT GAAAGATTTA CCAATTATTG CATTAGATTT TGAATCAAAA 4 3 80 

10 

GAAAAAGTAA ATCAATTTTT AGATTTATTT GATGAATCAT TATTCGTAAA AGTAGGTATG 4440 

GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGCCATGAT 4 50 0 

f$ GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 4 56 0 

CT AG CTAAAT TGAATGTTGA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAATGATG 4 62 0 

TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 4 680 

20 CAGCTTACGT CAACAACAGA AGACATGTTA CGACACGAAC AAAATATACA AACATCGATT 4 740 

GAAGAGGCCG TTTTAAATTA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 4 8 00 

TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 4 8 60 

25 ACACCAGGTA TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 4 920 

GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCGAT TACACAAAGT 4 980 

GACAAT C CAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 504 0 

30 

AAATTGCAAA ATCATTATTA GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 5100 

CATGGAGTTC AGGTATTAAA TCACCGATTT ACTGTGATAA CCGTGTTACG TTAGGTTATC 5160 

CTTTAGTTCG AGGCGCAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 5220 

35 

AAGTAGAAGT TATTTCTGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 5280 

AAAAATTAAA ATTACCAATG AATTATGTTC GTT CATC AAA TAAG AG T CAT GGTAAGCAAA 534 0 

ATCAAATCGA AGGTGCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 54 0 0 

40 

CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 54 60 

TAGGTGTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 5520 

45 TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTG AAGTA GCTGAmAmTG 5580 

AAGGTAAAAT TTCTAGTGAA GATATCCAAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 5640 

AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA AT G ATTT AT A 570 0 

50 TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATTCA AATGATTCCA ATCTATCTAG 576 0 

TAGTCACGAT GTCGACGAAA GTTCAACGAA GCAAAAACAT ATAAAAAATA AAACAACTAT 5820 

SB 
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TTAAAAACAA AAAAGCT 58 97 
(2) INFORMATION FOR SEQ ID NO: 321: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 6 0 

15 

TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 120 

GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

20 CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 24 0 

GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 3 00 

TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 3 60 

25 TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TAT C CGGAT A ATGTGCCTTA 4 20 

ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 4 80 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 54 0 

Of) 

AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 6 00 

JcGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 66 0 

GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 72 0 

35 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 780 

TTAT CGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTT CATT 84 0 

TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

40 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 9 60 

ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 1020 

45 CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 114 0 

CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 12 00 

50 AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 

AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 13 20 
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10 



15 



40 



TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTAGGAAAT 144 0 

GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 1560 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCTATTG 1620 

ATGAAAAATA TCCAGCCTGT CGTGAAATTA CTGCCCTCAA TTAAATATTT TAACGCACCT 1630 

TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 1740 

GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 1860 

TACTCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 1920 

ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACGA GATATTTTCC 1980 

AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAAA TATGTCTAAA GATATTACTA 2 04 0 

TTAATGATAA ACCTGCGCGA TTAATGGATA AAG TG CAT A C TGGCGACGAT GTATTTGTTC 2100 

CAACCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCG TTATGCACAA ATTAAATACG 2150 

AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC CCTAATGATT 2220 

TAAAAGAAAG CAATACTTTA ATGAATCATG TGATTTACAC TATTGATAGT GACTATGTCG 22 30 

AACCAATTCA TCGACTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAATCCTT 2340 

TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACGCGG ATATACAAAG 24 00 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 2460 

ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 2 520 

TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCmACT TGAAATCAAG TTGGATACTG 2 580 

GACGTACTCA TCAAATwCGT GTG CATTT AG CTGAAATTGG TCACCCTGTT ATTGGTGATC 264 0 

CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

TGcATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2 820 

TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2 880 

GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 2 940 

GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT ACATGACATC ATAAGATTTT 3000 

TAT CACAGG T TGTTTGGCCA AT ACAT G T AC AACAATTCAT CATATAAAAA ATAGGTTCTA 3 06 0 

TAATAAAACG GACTCCATGA AAAGTTTTTC CTTTTCATGG CTCTATATCA AATCAGACTT 3120 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TCTGGTTTAA 3 24 0 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 33 00 

5 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 3 360 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 34 2 0 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 3480 

W 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 3 54 0 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 36 00 

;5 TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 36 6 0 

TTCAGGGATA G AAATTT CTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 3 72 0 

ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 3 780 

20 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 3 84 0 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGATGCGCAT TAATACCAGT 3 900 

ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 3 96 0 

25 TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 402 0 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 4 080 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 414 0 

30 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 420 0 

ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTTAAATACA TATTAGCAAC 4 26 0 

ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC 4 320 

35 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 4 38 0 

GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 444 0 

CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4 500 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4 560 

TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4620 

45 TTAACTGTAG TATATCATTA AATAATTTAA TTGAGCAATT T AT G ATT AAA ACGTTTTCAT 4 6 80 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 4 74 0 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 4 8 00 

SO CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAATA CAGCACTAAC 4 860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCGCATA 4 92 0 
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TACAATACCT GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 504 0 

TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 5100 

5 

AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 516 0 

TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 5220 

GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 52 80 

w 

TGATAGATGA CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG 534 0 

TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 54 00 

15 TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 5460 

CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 5520 

CCCAATTTCT ACAGACATTG CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 5580 

20 AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 564 0 

AATTTCTACA GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 5700 

TCAGCTTAAA ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 5760 

25 TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 5820 

TTCTATGAAT ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 59 80 

AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 5940 

30 

TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 6 000 

TGATCAGAGA TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 6060 

ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 6120 

35 

ACCTCACAGA TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 6180 

AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAAT CG 624 0 

GCTGGTTCGA AGCTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 63 00 

40 

TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 63 60 

ACATTTAACT TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 6420 

45 TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AG CAATTGGA 64 80 

TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 654 0 

AAAATATTTA TTTGGCTAAA ATCAACAGCA TGTTTTT CAA CATTTTTCTT TAATTCATCT 6 600 

50 AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 6 660 

AATTGCTTTC TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 6720 
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GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA ACCATTTAAC TTTAACATGC 684 0 

TTATACTCTA AATATAGCAC TTAAGCATCA TTTTTATAAT GAAAATGAGT AAATTTTAAT 6 900 

TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT TAACAACGCA GCAGTATTTG 6 96 0 

ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT TAACGCTGTG TCTTGATATT 702 0 

TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC AAAAATGAAG TAAAAATCTT 706 0 

TGTCATGATT TGAAAAATCA AAATCACTAT AAGTCTTTTT ACCAAATTTT GTTAATAAGT 714 0 

AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT ATCATGATAC GTAATATTCA 720 0 

CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT ATCATCAGTT CTAAATCCAA 7260 

GAGGTTTAAT T AAATG T AAA TGTGTGTTTG TACCTGCACA CGTACGgCAA TGTTACCAGT 7320 

ATTAGCTGGG ATTTCTGGTT GATATAAAAC GATATOATTT GTCATATTAC TATTCTCTCC 73 8 0 

TTGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC ATCCTCTTGA TCATAATTAG 744 0 

CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTTGACC AATGGCCCAA TAAGCAGTTG 750 0 

CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT CAATTCTGGA ATTGCATCCA 75 6 0 

CTT CATT AAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG TATCGGCTTT TTACCACGCC 76 2 0 

AAGCACCTGC AAGgTGACCA T ATG TTTGTT TGAATTCTTT ATTAGACATA CGTAGtAAAG 76 8 0 

GTACTAATCT TGGCTTTAAA ATTTCTGGTT CCAAAATGAT GTCATCTTGT TCGGTATTAA 7 74 0 

TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA ACCATATAAT CTATTCCCAA 7800 

TTTTATAACG ATATTGGTCA GGCATATAGC CTTTTGTTTG CGTTAAAAAA CTAATGCATT 7 86 0 

TCTGACTATT TAATTGGCCA TTTCCAACTA ATGCACTTGT TGGACAACGA TCAACACAAA 7 92 0 

TTGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG GTTCC 7 9 65 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 
■*> (A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 6 0 

50 GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 120 

ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 
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TGTAGTTGTT GAAAATATTT AT CGACGCTT AACAGATTCA GAAGAACAAC TAAAAGGTGA 3 00 

AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT CGACACTAGT 360 

5 TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 4 20 

ACCTTTTGCA TTGGCTATTG CATTTAGTTT ATTAGCATCG TTATTAGTGT CAATTACACT 4 80 

CGTTCCAGCG TTGGCAGCTA CACTATTTAA AAAAGGCGTT AAACGTCGTA ATAAACAACA 54 0 

10 

TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GTATTACATT GGTCATTAAA 6 00 

TCATAAGTGG ATT GTAATTA TATTAAGTAC ATTAATTTTG GTTGCAACTA TTGTATTTGG 660 

AGGACCGAGA CTAGGCACTA GCTTTATTTC AGCAGGTGAC GATAAATTTT TAGCTATTAC 720 

IS 

TTATACACCG AAGCCTGGTG AAACGGAGCA AGCAGTGTTG AATCATGCGA AAGATGTTGA 7 80 

AAAATATTTA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 84 0 

TCCAGTAGAT CCAACGGGTA GTACAAATAG TATGGCAATC ATGGTTGAAT ATGATAATGA 900 

20 

CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTTATTAAA CATGCAGATG GCTTTAAACA 96 0 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 102 0 

25 TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT G T AAAAG AT A TTGAACAGAA 108 0 

AATGAAACAG GTTAAAGGAC TAG CCAATGT CAAATCTGAT TTATCGCAAA CATATGATCA 114 0 

GTATGAAATT AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 1200 

30 TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT A CG ACTGTT A AAGAAAATGG 12 6 0 

TAAAACTGTT GATGTTAAAG TCAAACAAAA TAAGCAAACA GC 13 02 
(2) INFORMATION FOR SEQ ID NO: 323: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

40 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 6 0 

45 

TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 12 0 

TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 16 0 

5Q TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 24 0 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 3 00 
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GATAGCTTTC TTTGTTCGCT GGAACATGTA AAGAAATAAT ATCGGCATCT TTAATAGCTT 420 

CTTTAACACT ATCTTTATAA GTTAAAAAGT CTAAATCTTT ATTAGGATAG GCGTCATAAG 4 80 

CTGTAATTGT AG CAC CAAAT CCTGCATATA TtTTAGCTGT AGCAGCACCG ATACGACCCG 54 0 

TACCAATAAT TGCAACAGTC ATATTTTTAA CTGGTTTAGA CATGATTTCT GCTTGCCAAG 6 00 

TAAAATCATG TGCTTGTACA CGGCGTTCAA TATCTGGGAA GCGACGCACT AATTGTAGGG 660 

CGATAGArAC AGAATACTCT GCAATtGTTT CAGGTGAATA ACTAGGAACG TTAGATATCA 72 0 

CAATATTGTG TTTTTTAGCT AAATCTAAAT CATACATATC AAATCCAGCA GTACGTTGTG 780 

CAATTTGTTT AATACCGTAA GATTCTAATT TAGGATAAAC GTCATTTTCT AACTTACCAA 840 

ATTGCATTGT AGT7ACGCCA TCGTAATCTT TTAATTGAt C GACTGTAGCA CTTGATAATA 90 0 

GCTCTTTAGA AGTAGTTACT TCGACATTAT TCTtTtTCCC CCAATTTAAT GCCATCTCTT 96 0 

TCTCATAATC ACGCGTaCCA AAGAACATAA TTCTCGTCAT TAT 100 3 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GG CTTTTAT A GTGTTTACGA TTATTATGAT TACTTATATT CACGCTGTTA AAAATTATCC 6 0 

35 AAATAACCGT ACTGTTCATT ATGGTTATAC AGCTGCATTT ATACTTGTTA TATTACAAGT 12 0 

TATCACAGGT GCATTATCTA TTATGACAAA TGTTAACCTG ATAATCGCAC TTTTCCATGC 180 

ATTATTTATC ACTTATTTAT TTGGTATGAC AACATACTTT ATCATGCTAA TGTTACGATC 24 0 

AGTAAGAAGT GACArGCAAT AACaAAAAAG crrAACCGTAA TTTTAATGGC ACGCCCATTA 3 00 

AAATTACGGT nTTTTATATC AATATTTAAA AtTAAACCtA AGCCATGTAA AAACGAGATT 3 60 

ACACGTCAAT TGTTGTGTAA TCTCGTTTTA Tn TTAATCAT TTTAGTCAGT TGCTTTTTCA 42 0 

ATTTCGATTA ATAAATCGCC TGTCGCTATT GTGTCACCAT TATTTACAGT TACTTGTTTA 4 80 

ATCACACCGT CAAATGGTGC TTGAATTGTT GTTTCCATTT TCATAGCTTC AG T AATTAGC 54 0 

AACGGCTGAT TAGCTTTCAC AGTTTCACCT ACACTAACCT TGACTTCAGT TACTGAACCT 6 00 

GGCATTTGAG CACCGATATG ACTTGGATTA CTCTTATCTG CTTTTGGCTT AACGTTCGCA 66 0 

TTTGTATGCA CATTTTCATC TTTAATGTAA ATACGTCTCG CTTGACCATT CAT CGCATAG 72 0 
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CGTTTACCTT TATCGATTTC GATTTCTACT 
GTAGGCGTAT CAAGTAACGA TAAGTTTCCG 
5 TATACTTTTG GATATAGTAC ATAACTAATA 

TCTTCTTCAA GCAACTCACG GACTTTTTCA 
CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA 

10 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG 
TTATAGCCAT CTGTAATCAC GGATTGTTCA 

15 ATATCACCAA CTACTTTAGA CGATGGTGTT 

ACTCTGCGAT ACATATCTTT GACTTCATCA 
TGTTGACTTA AA7TCGAATA CTGTCCACCA 

20 GGTGATTTGA TATCACTTTc AAAGTCTGAA 

CTAAGTGACT CCATACCTTC AATATCAGTT 
TAATATAACG AATTGGCGCT TGGCTGACTT 

25 

TCAATGATAT CGACACCAGC ATCTATTGCT 
GTATCATGAG TGTGAAGATG AATTGGTAAA 
TCGTAAGCGG CTTTAGGTTT TAATAAGCCT 

30 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG 
CGCTCAGGAT TTAAAATGTC ACCTGTATAA 
35 TCTTGTACTG CTTCATTGGC AACTTTCATT 

CTAAAGACAT CTATGCCTGC TTTAGCACTT 
GGATAGTTTT TATAACCAAC TGCGTTTGAA 

40 

GGAATAGCTT TACGTAGACG TTCAAGTCGT 
GCCACATCAA ATGTAGCACC GCCCCACATT 
GCTGTTTTGG ATGCGATATT AATCATATCC 

45 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT 
TCAGCTACAC CTTTTGGACC TACTTCATCA 
SO ATTTTACTTG AAGATACAGT TGGAATTGAT 

TCAACATTTG GGAAACCATT AATTGTTACA 
CTATCTAGAG ACGGCTGAAT GTCGAAAAGT 
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GTTTCACCAT TACGCATTCC AAAGAAGAAT 84 0 

TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

ATATCTTGCT CCGTAACAGG ACCTTGTTGC 96 0 

AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

ATAACCGCTT GTAAATCTTT ATTAAAACCA 1080 

AACGACACTA CTGATTCTGG GAAATCTAAT 114 0 

TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

ACTTTTACGA TATCACCAAA TAAGAAATTC 126 0 

AATCTTTCGC CTAAACCTAA ACTTTTAGCT 132 0 

GGCATTTCAT GTTGATAAAT TTCAGTATTC 13 8 0 

TAATAAGTAC GTACAGTTGA CCAATAATGA 144 0 

CTAAGGTGGC GTGGGAAGCC ATTTAATG C A 1500 

GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

TGTTTGTATG TTAATAAACC ATTACCACTT 162 0 

TCTACAGCTG ATTTTAACTC ACCAATCAAT 16 80 

GCCATATCTT TAATCGCTAA AATATGGAAA 174 0 

ACATAATACT CTAAAGTATA AATGTTTGAT 1800 

CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 186 0 

TGATCTACCC AGTTTAATGA ATCGAAAATT 192 0 

TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GCACGTAACA ACATTTGGAA TAATACATTT 204 0 

TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

TCTAGTGAGA AACCATCTTT AAATACGTCC 216 0 

TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2220 

AGTAAGACAT CATCCTGCTT TTTAACCCAT 2280 

AGCAATTGTT TCGTACCACT AAATGAAGCG 234 0 

GCTAATTCAT AGTCTGGTTT CGGACGTTTC 24 00 

TTACCTATAT ATTCTAATGT TTTAGTACCT 246 0 

TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2 52 0 
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TTAACACCAC GAAT A CG CAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 264 0 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 270 0 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2 76 0 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 2 82 0 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2 880 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2 94 0 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 300 0 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CATATTTAAT ATTTTCCATC 3 06 0 

15 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 312 0 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3180 

20 ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 3 24 0 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 33 00 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 3360 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTC TTTTGCTAAT 34 20 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 34 80 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 3 54 0 

30 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 3 6 00 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 3660 

35 TTTAAATAAC TTTCAGCAGG AC CTAAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 3 72 0 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 3780 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3 84 0 

40 TTTATTTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 3 90 0 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 3 96 0 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4 020 

45 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4 080 

AAAATTAAAA AT AAAATTG C AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 414 0 

TATCATCATG ATT CT AATTT CGCCACCACA TTTAGTAATT ITT AGT CAT A AAATTT AG TT 4 2 00 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4 2 60 

AACaAAAAGG AAGTTAAGCG TTG1TTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4 3 20 
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TCTCGTACTA AATATTGGCT AGTATTTTTT TAATTAAATT 
TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT 
5 CCCATAGCAA TACTTAAACT AATCATTGAT GATCCACCAA 

ACACCAGTTA ATGGAATTGT TGCCGAAATA CCGCCAATGT TTACAAACGT TTGACTTCCA 4 620 

AAGTATGTGG CAATCCCAAC ACACACAAGT TTATAAAAAT ATGATGATGT TTTATTTGCA 4 680 

10 

AACTGGAAGG CACGATATAC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 4 74 0 

AAACCTAATT CTTCGCAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 800 

CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 860 

AGCAATGAAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 4 920 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

20 ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTTGTT 503 0 

(2) INFORMATION FOR SEQ ID NO: 325: 

<i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 
CTTGTTAATC CGAAAATTAT TAGTCAATCA AATGAAACAA TAACAGACTT nGAAGGTTCA 60 

35 ATTACATTGC CAGATGTTTA CGGCGAAGTG ACAAGAAGTA AAATGATAGT TGTCGAAAGT 120 

TATGACGTCA ATGGGAACAA AGTTGaACTA ACTGCACATG aAGATGTAGC AAGAATGATT 180 
TTGCATATTA TAGATCAAAT GAACGGTaTC CCTTTTACAG AACGTGCGGA CCGTATTTTA 24 0 

40 ACAGATAAAG AAGTGGAGGC ATATTTTATA AATGACTAAA ATAATATTTA TGGGTACACC 300 

AGACTTTTCA ACAACTGTTT TAGAAATGCT TATTGCAGAA CATGATGTCA TTGCAGTCGT 3 60 

AACGCAACCA GATCGACCTG TTGGACGTAA ACGTGTTATG ACACCACCAC CAGTTAAAAA 4 20 

45 

AGTTGCAATG AAATATGATT TACCTGTATA TCAACCTGAA AAATTAAGTG GATCAGAAGA 4 80 

ATTAGAACAA TTGCTTCAAT TAGATGTAGA TTTAATTGTA ACTGCTGCTT TTGGACAATT 54 0 

50 ATTACCTGAA TCATTGTTGG CATT AC CAAA TCTTGGGGCA ATTAATGTAC ATGCATCATT 6 00 

GTTACCGAAG TATAGAGGTG GTGCACCAAT TCATCAGGCA ATTATCGATG GTGAACAAGA 6 60 

AACCGGCATA ACAATTATGT ATATGGTTAA AAAATTAGAT GCGGGTAATA TTATTTCGCA 72 0 

55 



GTCTTCTTAT ATCAACTTTT 4 44 0 

TACCTACAAT CAGAAGTAAA 4 500 

AGCTGATAAA TGGCAATGGC 4 56 0 
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ATT AGGGGCA GATTTATTAA AAGAAACTTT AC CAT CT ATT ATAGAGGGCA CAAATGAAAG S4 0 

CGTACCTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TATCACCATG 96 0 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 1020 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 1080 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATG CAATT AG CTGGGAAAAA 114 0 

GAGAATGTTA GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 1200 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 1260 

GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATGCAATG 132 0 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAGAT 13 80 

TTTTATTTA 13 8 9 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 6 0 

AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG ATAATAGTTA CT ATT AT CCA GTTATAAAAG 24 0 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 300 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GATTTAGACC 360 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

aAGAAGATGG cmAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 4 80 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 54 0 

CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTT CAAT AT GAAAATACAT 6 00 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATT AAATG C a ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 72 0 
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AAATGATTGA ATACGGTAAA TCACAAGGCA GAG AT ATT CA TTATCAAGAA GGCGTACCAT 84 0 

CATATGAACA AGTTGATCAA CTTACAAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 90 0 

5 GTGTATCTCA AAACCCTAAT GACCCACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 9 60 

CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 10 20 

TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 10 80 

w 

GTTCAATGAT AGGTTACkAA AAAGTAATAT AGATATTGAT TAAAGGCAGG TAAAACTATG 114 0 

TATCAACTAC AATTTATAAA TTTAGTTTAC GACACAACCA AACTCACACA TCTAGAACAA 12 0 0 

J5 ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA ATCAATTTGT 12 6 0 

ATACGTCATG GCGATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 1320 

CATCAACGCA TTAAATTTTC ATCTATTGAT AATGAAGAAA TCATTTATAT TCTTGATTAT 13 8 0 

20 GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 144 0 

CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 150 0 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 156 0 

25 

TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 16 2 0 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 168 0 

GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

30 

CACGTTTTTC TTTAAACGCA TCACGGCCTT CTTTCGCTTC ATCAG TTGTG TAATAAAGCA 1800 

ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 186 0 

35 CTGCTTTAAG GAATCGTAAC GCTGTTGGTG AGTGTTTCAT AATCTCTTTA CACCATTGCA 192 0 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC AT AT CTAAAG 198 0 

CTTCTTGTGC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2 04 0 

40 

CGATACGTGC TAAATATCCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 22 2 0 

45 

TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 2280 

CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACCAGTTA 234 0 

so ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATG CGTCA ATCATTTCAG 24 0 0 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 24 60 

TCCCTTCGTA AAATT CAT AT TTGATTTCAT CATATTCTCT AAGTGTTTCC CATTGTCTGT 2 52 0 

55 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 2 64 0 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2 7 00 

GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 2 74 6 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
CD) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAATGTTTAG TTTATTAACA GTAAGTTCGT ATATCAATGT TTAGTGCTCC CCAAAATTGA 6 0 

20 AGTTTGAATT TTAAAAG CAT CTTGTAGAAT TTAGTTGTAT TTTTTTCAAA GAAATTCATT 12 0 

TTGATTATTT TTGATAATGA GCATTTTAAT AGTAATACAT GTTTATAGTG TGTAGTATAT 18 0 

GTCTATACTA GTAGTAACTA TATAGAGAAA GTAGGAATAA ACTATGTCAC AAGATGTAAA 24 0 

25 

TGAATTAAGT AAGCAACCAA CGCCAGATAA AGCAGAAGAT AACGCATTTT TCCCATCACC 30 0 

ATATTCCCTT AGTCAATATA CAGCACCTAA AACAGATTTT GATGGTGTTG AACACAAAGG 36 0 

TGCCTATAAA GATGGTAAAT GGAAAGTATT GATGATTGCT GCTGAAGAGC GATATGTATT 42 0 

30 

ATTGGAAAAT GGAAAAATGT TCTCTACGGG TAATCATCCT GTTGAAATGT TATTACCTTT 4 80 

ACATCATTTA ATGGAAGCAG GTTTTGACGT TGATGTTGCG ACATTATCTG GTTATCCAGT 54 0 

35 TAAATTAGAA TTATGGGCTA TGCCAACTGA AGACGAGGCA GTTATAAGTA CTTATAATAA 60 0 

ATTGAAAGAA AAATTAAAAC AGCCAAAAAA ATTAGCAGAT GTGATTAAAA ATGAATTAGG 66 0 

ACCTGATTCA GACTATTTAT CTGTCTTTAT CCCAGGCGGA CATGCTGCAG TTGTTGGTAT 72 0 

40 

TTCTGAAAGT GAGGACGTTC AACAAACATT AGATTGGGCA TTAGACAATG ACCGCTTTAT 780 

AGTTACATTA TGTCATGGAC CAGCAGCACT ACTTTCAGCA GGGCTTAACA GAGAAAAATC 84 0 

TCCATTAGAA GGATACTCTG TTTGTGTCTt CCCTGaCTCA TTAGATGAAG GTGCAAATAT 900 

45 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3642 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT TTaAGcGtTT 6 0 

CaTCaGCTAA CaCtTCaTTa TCtTTAGt CG cTTTATGcCC tTCt ATTAAA CtCGGaACtA 120 

5 AAAATGaTGa CtTTTCAACa CCGAACaCTA AAACaTTGtC tTTTTGATGG CCaTTAGACA 180 

CAATTTCCCC TGTTTGCTTC AAAGTAGCTT GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGTTAAAAAC AGATTGTTGC ACAGTTTGAT TGGCATCTTT ATTTAGAACA ATGGCATCTG 3 00 

w 

CTTGCCACTT ATCAATGCCT TCTTTATTCA TATTGATAAG ACCATTCGCC AATCCAGATA 3 60 

ATAAAAATAG CAAGTAACTA ATCATCGTTA ACACACCAAT AATTAGTCCA AACTTCAATT 4 20 

15 TGTTGCGCCG TATTTCATTC CAAGCTAAAA ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

TAAACATTGT TTATATTCTT AGATGCACGT ACGTCGTGTT GCGCTCTGTA ATGTTATACA 54 0 

TACACTTATC CTTCATTATA CCCGAACTTT TTATATTAAA AGCAAATTTA TGGAAAATGC 600 

20 AATTAATTGT CTATTATTTT TGTACGGTAC ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CTACATATAC CTTTAAGTAC ATGTAGACGT CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

TTTAGGATCG AATTGCTTAA TAGCATTTTT ACGTAATTTA TCTTTTGCTT TrTCACTTGC 780 

25 

TTTATAGTTA TTGTTGTAAA TCGTAGCTTC CCAACTACCA TACATTGGGT TAGGGAAAAT 840 

GATATATTTC TTACCGAAAT CGTCTTTATG TTTTTCAATT AATGCTTCAC G AG ATTCAG C 900 

TGTAGCTTCT TTTGGATCTG TAAAGTCTAA TAAATTATCT CCAAATAGCA TGACAAGTTT 96 0 

30 

ATGATCCTTT TGAACCATTT GTCTGCGTGA TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

TAAAATATGA CTCTTCTTAG CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 10 80 

35 CTTTAAATCT TTTTCTTTAT CTCTATCAGA AATATAGTAG ATATCGACAC CTTTTTTGTC 114 0 

AGCATATTTC AAGAATTCTT TTGCGCCATA GACAGGTTTA GCTTTAGCAG cTTGTACCCA 1200 

TTCATGCCAA CCTTCTGGGA AAGGTTTATT ATGTATTGAT GCATAGCCTT GATATGGAGA 1260 

40 ATTATCTAAA ACTGTTTCAT CTAAATCCAA AGCAATAGCT AACTTATGTT TACCTTTATT 13 2 0 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT TGCACTGTTA TAACCTTGTA AATATAATGC 13 80 

TTTTGCTTCA GCTGAATTTT GATACCAAGC CACTGCCATA ATATTTTGAT TACCAAGATT 14 4 0 

45 

CGCCTTTTGT GATGCTGGTA TAGAAGCTTG TTGCGTTTGT TGAACTTCAG CAGAACTTTT 1500 

GGCAAACGCT GTAGAATTTG TCGTTTGTGG TGCTGAAACT GTAACCGCTA CCGATAATGA 1560 

50 TGCTATTGCA ATATACTTTG AAAT7TTATT CATCTTATCA CCTCATGATT AATATTTAAA 162 0 

ATACAGTTAA AATTATAAAT GCATTTATTT AATATTGCTA TACTATGAAA AGATATTTAA 16 80 

TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 174 0 

55 
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GTGACTAGGC CTTCCTATCA G A CAT ATT C A CTCATCCACG TATCATTATG TGTACAGTGT 18 60 

GCTATCTCTT ATTTACCTAT TGGAACAACC ATAAACTCAT CCATAGTTTA CCTTTTATAA 1920 

ATAGCAGTCC TCACTCATAC AATTTCTCAT AAAAATCACA ACGCTCCAAC GTATTTCCAA 1980 

CTTACTTTCA CCTATTTTAA TTCATAAAAA CGACACTTTA ATTGTCATTA TCCAATAATA 2 04 0 

GCAAGACGTT ATTATTGCAA TCTTTTTTAT AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

AAGATAAAGT AAGAA CGTTT TTATTTTTCA GATTTTTTAA ATTATTATGA ATATCTAGTT 2160 

TTAGGAAGGA AATTACATTG AAAAAACAAG TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 

TATTTTTTGG AGCCGGAAAT TTAATATTCC CGCCCATGCT TGGCCATACA GCGGGTCAAA 2280 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC TTACAGGCAT ATnACTCCCC TTTATTACTG 234 0 

TTATTGTTGT TGCATTTTAT GATGAAGGTG TTGAAAGTGT AGGCAATCGT ATA CATC CAT 24 00 

GGTTCGGGTT TATTTTTGCT GTCGTGATTT ACATGTCTAT CGGAGCATTT TACGGTATTC 2460 

CACGTGCTGC AAATGTCGCG TACGAAATTG GTACAAGACA CATTTTACCT GTGCATAACC 2 520 

AATGGACTTT AATTATATTC GCAGCAATCT TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 2580 

ATCCATCGAA AATCGTTGAT AATTTAGGTA AATTATTAAC ACCGTTATTA C T ATT AATGG 264 0 

TCGCTCTATT AAGTATTGCT GTCATTTTCA ACCCTGAATC TGCACTAAGT GCACCTAAGG 27 00 

ATAAATATAT AACACATCCT TTCATTTCAG GAAGTTTGGA AGGCTATTTT ACAATGGATC 27 6 0 

TTGTTGCTGC GTTAG CTTTT TCCGTAGTCA TTGTCAATGG CTATAAGTTT AAAGGCCTCA 2820 

CAGATCGCAT GAAAATTTTA AAATATGTCT G CTTTT CAGG TCTTATTGCA GCCATATTAC 28 80 

TTGGAATGAT TTACTTTGCA CTTGCATACG TTGGGGCATC AACAGCTCCA GGAAACTTTA 2 94 0 

AAGATGGTAC AGATATATTG ACGTACAACT CATTACGATT ATTTGGTTCG TTCGGTAACC 300 0 

TCGTATTTGG AATGACGGTT ATCCTTGCAT GCCTAACAAC ATGTATAGGA CTCGTcAATG 306 0 

tTGCGCCACA TTTACTAAGA AACACGTACC TAAGTTTTCT TATAAAATAT TCGCACTTAT 312 0 

TTTCtCTATC ATAGGGTTCT TATTTACAAC ACTTGGTTTA GAAATGATTT TAAAAATTGC 3180 

TGTCCCATTA TTGACTTTAA TATATCCCGT GTCGATTGCA CTTGTACTCA TATCATTTGC 324 0 

45 

TAACATGTTT AGCACATTCA GATTCAGTTG GGCCTATCGA CTCGCAACTG TTATTACATT 33 00 

GATTATTTCA ATTTTACAAA TACTAAATAG TTTCAACTTA TTACACGGTG TTATTTTGAA 3360 

so ATCGTTTATG ATGTTACCTT TAGCAGATAT CGATTTAGCT TGGCTTGTAC CATTCATGCT 34 20 

CTTTGCTATT ATCGGTTTCA TAATCGATGT ATTTATACGC CGTCCGAAAC AAGCGACAAC 34 8 0 

TTAATAAATG CTCACTGCCT AGTAATGATT GACCCATCGT TACTAgGCTT TTt t ATATGA 354 0 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 3 64 2 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 
TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 6 0 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GG TAT AT CAT 120 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 180 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 240 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 3 00 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 3 60 

25 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 4 20 

TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 4 80 

ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 54 0 

30 

TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 600 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 660 

35 CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 720 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 7 80 
TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 84 0 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 900 
TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 1020 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 1080 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 1140 

AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 1200 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 126 0 

TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 132 0 
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TTTAAATTAA TTTTATGTAA TAT AAA TACT GCATTTGCAA ACTGTTGCAC TTTTAGGTAT 144 0 

AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 15 00 

CTTGGTATTA GTTTAATCGC CATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 156 0 

ATGACTGGCG GAAAGAAAGA TACATACTAT GGAATTATGA AAGATAATAC TACTATTGAA 1620 

AAAATGATTA GTGAAAAAGA TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 16 8 0 

GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGTTTATA AATTAGCAGA TT CAG AT AAA 174 0 

ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 1800 

CATGACATGG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT 186 0 

GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 192 0 

TACTGCTGCT TTTTATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 198 0 

TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 2 04 0 

TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 210 0 

CTGCGCTTTA GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 216 0 

CCTGCGCTTT AnCATGGCCA TAAAAGC 2187 
(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 0: 

CCnCCTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 60 

GCTTAAGATA GAGTGTAATA CGCTCCTGTT GTTCTTTATA GTATATTGCT TTTTGTTTTT 12 0 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 24 0 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAA CC AT AAC 3 00 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 360 

AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 4 20 

GATAGCTATT ACATTAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 4 80 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 54 0 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 660 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 720 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 780 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 84 0 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 96 0 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 1020 

15 TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT 108 0 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 114 0 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATT CTTTTTA TTTTTTCAGG 12 0 0 

20 GTAAATCATA AATAGATATT ACTTTTAAAA AGCGT AT CAC AATTCATATA ACGGTCATAA 126 0 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TACAATTAAT 132 0 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 13 80 

25 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 14 4 0 

GAGGAATAAA AC g TG A CAT A TCATGAGCGT GTTTTAGCAT TAAGAGCAGA AAGTAAAAGA 1500 

30 ACCG CATTTG ATTTTCGATT CGAAGATTTA TTT AG CAAAG AAGAATGGcT AAGTATGTCT 1560 

CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 162 0 

TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 16 80 

35 CTTATAACGG Ca TTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 174 0 

TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTnTATG GTnAATCC 1788 
(2) INFORMATION FOR SEQ ID NO: 3 31: 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TAAGCCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 6 0 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 12 0 

ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 180 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 360 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 4 80 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 54 0 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGGAGGT TTAATTATTC CGGTAGAAAT 6 00 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 66 0 

ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 78 0 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 84 0 

TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 9 00 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 1020 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 1080 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 1140 

ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 1200 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 126 0 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 1320 

AGCAGCAGGT GCTGACTATG T 1341 
(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 6 0 

ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 12 0 

ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 18 0 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 24 0 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 360 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 4 80 

TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 54 0 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 

AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 66 0 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 72 0 

CCAATTTTTC TAACATGATC TAAAATGCTC ATTGCTAATG CAGCACCTTC ACTCGGATCT 780 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 84 0 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 90 0 

20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 96 0 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 102 0 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 10 8 0 

25 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 12 00 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC AT CTTTGT C C 12 6 0 

30 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 132 0 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 13 8 0 

35 TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 144 0 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 1560 

40 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 168 0 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 174 0 

45 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 180 0 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 186 0 

SO AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 192 0 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1S80 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 

55 
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TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 216 0 

CGC CAT AATT CATAAAATCT AAATGATTTG TATGATGCGC AT CAGTGTT A ATAGTTAATT 222 0 

5 

TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 22 60 

TAATTTCTAA TACTGTATTC GTTTCTTCAG CTAATGCCAT TAATTGTTCA ATATTCGGTT 234 0 

TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 24 00 

w 

GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 24 6 0 

GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAG CG 2 52 0 

>5 AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 2580 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 2 64 0 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 2 700 

20 

TTGCTTCTAC CATGTCTCGA ATAGAAAACG CACCATCACT ATACGTTGTA TGCATATGAA 2 76 0 

TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 2 820 

CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 2 8 80 

25 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CTTACTTTTT 2 94 0 

CATCACGTGC TTTAGCAAGT TGTCGAATTC TTATATTATG TTCTTTTGAC CCAGTAAAAT 3 000 

30 GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 3 060 

CAT CAT CAT A CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTTTTACTT 3120 

TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTG CTTTTGG GTTATCGGTA CTTATTATGA 3180 

35 AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 3 240 

CGATATAATT TAATGTATCT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 3 3 00 

CAATTGGATA TCTATCTTTC TTAGCAC CAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 3360 

40 

CCGTTTTCTT AGCAAATCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 342 0 

GTGACGCTTT ATCAACAATA TTCAACTCTT TAT AT AG CTT AGCAATTTTC TTGCTTC CAA 34 80 

GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 3 54 0 

45 

AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 3600 

TGCCTTTt AA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATT CAAGAC 3 66 0 

50 TTTGAGCTGC TTTTCGaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTTT AATTCCATAT 3 72 0 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 3 780 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 3 84 0 

55 
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TAGATATTTT AAGCTGATGT TGTAATGCTT CGTTAGGATA TAATGCCAAT AGATATAACG 3 96 0 

TAAAGTGTAA GACAATTATC GTCATAAACA CACCAACTAT CATTCCCATT GCACGACTGA 4020 

5 

AAATATGAAT GTTTTGATAC GCTATTATTT TATCAAAAGT TACGATAATT AGATATAAAA 4 080 

TGAACTTACA AAACAATGTA ATCATTAAAA AAGCTACAAT CGCTTCAAAT CGATTTTGTA 414 0 

GATGATTAAA ATGAAACGCA AAAGTTGTAT TAAATGCTGT TGTTTTAGGA TATGGAATAA 4 200 

10 

ATACAATTAA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AATGATACAA 4 260 

nCGTTGCACT CAAATGTATC ATAGATAACC AAAAAC CTCG TCTGAATCCA ACGATGACAA 4 3 20 

15 AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT TTATTGCTCA CGCTGCTGCA 4 3 30 

ATTTGTGAAT TTGTTGTTTC AAACGTCGAT TTTTTTTCTT CTAGTAGTAC TTTTTCATGC 4 44 0 

ATAATATTCA CAGCAGTTAG TATTGCTTTT CTTGAAGTAT CTAAACCTGC TGCTTTATAC 4 500 

20 

CCTAATTCTT TTATTTTATC ATCAACTAAA TGTGCTACAT ATCGTATGTG CTCTGGGTTA 4 560 

TCTTCCCCAA CAATTGTAAA AAGCTGATCA TTAATTGATA CATTTACCTT GTTTTTAAAC 4 620 

TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 4 6 80 

25 

TGACTGGATA GTTTGTCTGA ATTTGATACT AATATTGTTA TATTGCAATT ATGATAAAAC 4 74 0 

AACAACACAA TCTCTATAGA TGACTTAATG TTCTTTTTAT AATGAAATAA TGTAAAGAAT 4 8 00 

3Q TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTGTCACTA TAACATTTTC A T AAA C ATT A 4 860 

TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 4 920 

GTTTCTGACG TTATTATCAT TTATGATATG ATTATTTTTG CTAAAGACAA TGAAATTTTA 4 9 80 

35 TGAAAGGATT TACACAAATG GCGAATATCG TTTTTAAATT GTCGGATAAA GACATAACGA 504 0 

CATTAATGTC ACGCATTTCT TTTGATACTG AGAATTTACC TCAAGGnATG AnAGCACGTG 5100 

CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 5136 

40 

(2) INFORMATION FOR SEQ ID NO : 333: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 6 0 

ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 

55 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT 
CACAATATTA TATAGACCTA TTTTATTTTG 
5 ACCTTTATAA AAATTCATAC CTTTATGCTA 

AATTCTTAAC TATCCTTTGA TGATTGTTTT 
CGAATTTCTA TAGTTCAATG CTACTAAAAA 

W 

TGTTTTATAA TTATGCTTGC GATGGTGTTT 
ATTTAAACTC TTTACCATCA TGATCTACTG 

15 CTAATATTAA TTCACTTAAA TTATCTTCGA 

TTGCACCATA TTCTGGATCA TATCCTTCTT 
CAATAATATT TATGTTTTGT TCAGATAATC 

20 CAATTTCTTT TAATTCTTCT TTTGTTAGTT 

TTAAAAATTC TGGACGGAAT GAATTTTTTA 
CATAATCTTG TCCATCACTT GAACCACCGA 

25 

GTGCCCCAAC GTTTGATGTC ATTATGATAA 
TTGTATCTGT CAAATGTCCA TCATCTAAAA 
GAGCTTTTTC AATTTCATCA AATAAAATTA 

30 

TTAATTGTCC ACCATCATCA TGACCAACAT 
CTGCGTGTTT TTCCATAAAT TCACTCATGT 

35 ACATTGATTC AGCTAAAGCT CTAGCTAATT 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT 
CTTTACTGAT TGAATTAACA GCATCTTTTT 

40 CTAGACTAAG AAGTTTTTCA GATTCTGTTT 

ATCCTGCAAT AACTTCAGCA ATATCTTCTT 
CATTCTTCCA TTCATTTTTA GCTTCTTCAT 

45 

GCAGgTTAGC AGCATTTTCA AACTCTTGAG 
CTTTTT C AAT TTCTTGTTCA ATTTCTTTTA 
TTACTTTAGA ACTTGCTTCA TCAATTAAAT 

SO 

AAACGTATCT GTTACTTAAT TTAACAGCTG 
GATGGTGTGC TTCGTAACGA TCTCTTAATC 

55 



TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 24 0 

ACGTAAGTTG CTAGTATCTT CAAACAAAAA 3 00 

TCGCTGTAGG CTCATTAACT TGTTACATAT 36 0 

ATTAGATTGT TTCGTTGATG GATACTTTCA 4 20 

AACAGCCCAA AACTTTAATT TGTTTTGGAC 4 80 

TAGTTTCTGA AGTTTGTTCA G CAATGT CAT 54 0 

TAACTTTCTT ACCTTCAATT TGATTACCAT 6 00 

TAGTTTTTTG TATCGCTCTA ATTAATGGTC 660 

CTGCGATTTT GTCTTTCGCT TTATCAGTTA 72 0 

GATTTGTTAA TTTATTAACC ATCATTGTTA 780 

TATGGAATAC AATGATATCA TCTACACGGT 84 0 

ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 

ATCCAGCAAA TCGTTGATCT TGTAATTCTT 960 

TTGTATTTCT GAAATCAACT GTACGTCCTT 102 0 

CTTGTAATAG AATATTAAAT ACATCTGGAT 10 80 

CAGAATATGG TTTACGTCTA ACTTTTTCAG 114 0 

ATCCTGGAGG AGCACCAACT AATCGGCTCA 1200 

CTACACGGAT CATCGCATCA TCATCGCCAA 12 60 

CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

CTTTTAACCC TGCACGGGCA CGTCTAACCG 13 80 

GCCCAATAAC TCTCTCATGT AATGTATCTT 144 0 

CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

CTGACAATGA AGTTGaCATG CCATTTTGTG 156 0 

ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

CATGTACTGC GGCATCTTTT TCATTTTTAA 168 0 

AATTATTAGG TGTCGTATGA CTCTTAAGTC 174 0 

CAATTGCTTT ATCTGGTAAG AAACGATCTG 18 00 

CTTCAATAGC TTCGTCTGAA ATATTAATAC 186 0 

CTTTTAAAAT AGCAACTGTA TCTACTACTG 1920 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC CAATACATTG TAATTCACCA CGTGCTAATG 2 04 0 

CCGGCTTCAA AATATTCGAA GCATCGATAG CACCTTCAGC ACCACCAGCA CCAACTAAAG 2100 

TATGCAACTC AT CAAT AAAT AGGATGACAT TACCTGCTTG TTGGATTTCT TCCATAACCT 216 0 

TTTTCAGACG CTCTTCAAAT TCACCACGAT ATTTAGTACC TGCAACTACT GTTCCCATAT 222 0 

CTAAAGACAT AACACGCTTA TCTTTTAATG TCTCTGGTAC CTCATTATTC ACTAtGGCtT 2 2 80 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC CAACACCTGG CTCTCCAATA AGCACAGGAT 234 0 

TGTTTTTCGT ACGTCTACTT AATACTTCAA TTACACGTGT AATTTCTTTA TCACGTCCTA 24 00 

TAACAGGATC TAATGTACCG TCTTTGGCAA TGACTGTTAA GTCACGAGCT AAACTATCTA 24 6 0 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT GTGCATTTTT ATTACTCATT TCAGGGTTTC 2 52 0 

CTAAAGCTTT CACAACTTGT GCACGTGCTT TAGTAATATT TAAATCTAGA TTTGCAAAAA 2 580 

CTCTTGCTGC AACACCTTCA TTTTCACGAA TCAAGCCTAA TAAAATATGT TCCGTTCCAA 2 64 0 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT CCATCGATAA TTCAATGACT TTTTTAGCTC 2 700 

TAGGTGTATA ATGCAATGTA CCAACATGAT CTTGACCATG TCCGATTAAT TTTTCAACTT 276 0 

CTTCAATTAC TTTATCTTCA GTGATATTAA AACTTTCTAA TACTTTTGCA GCAATTCCTT 2 82 0 

CAGGTTCTTT CATTAACCCC AATAATAGGT GTTCTGTTCC TATATTTGAA TGATTTAAAC 2 3 80 

GAATTGCTTC TTCTTGGGCA TGTGCTAATA cGCGCTGTGC ACGCTCAGTT AATCTACCAA 2 94 0 

ATAACATAAA TAATGACCTC CTACTTTATA TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3 0 00 

TTTACAGATT TGTCATCTTC TTCATCTAAT AAAAATGGTG ACTGTATAGC TACCATCAAT 3 06 0 

TCATTAAATT TAAAGTTTTG TAATTCAATG TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TTAATCTATA AGAAGCCTCT TCCATAGTTA TCATTCTACA GTTTTGTAAA ATACCTAGCG 3180 

AGCGAAAAAC ACGGTCTTGT GTTTCTAATT GATTATAAGT GTCTAACTTT TGTCGTATTT 324 0 

GTTTTTCTTC ATGAATGATT TGATTAACAA CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3 3 00 

CAGATTTACC AAGTGTAAGT TGGTTGGATA CTTGATAAGT ATGTCCATAA ACTTGCGAAC 3 360 

CTTCACCGTA AATACCTCTG ATTGTATATC CAAAACGATT AATGGTTTGA GCAATCCGTG 34 2 0 

TCATTCTTTT CATAATAGAT AGACCTGGCA AATGTAGCAT CACGCTTGCT CTCATACCAG 34 80 

TACCTATATT GGTAGGACAT GTAGTTAAAT AACCAAGTTG TT CATCATAA CTTATATCAA 354 0 

GGCTTCGATC TAATTCATCA TCAATTGATG AAGCTTGATT ATATAAAGCC TGTAATGTCG 3 600 

TGTCAGTTCC CATAGCTTGA ATACGAATAT GGTCCTCTTC ATTTATCATG ACACTTAAAG 3 66 0 

ATTCATCGTC ATTCACTAAT ACTGCGGCTG CTGGTTGTTT TATTAGTTCA GGACTAATCA 3 72 0 
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CAAAGTTTGG CAAGG CATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3 84 0 

GCATACATTA GTGGATGCAC ATGATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAGAC 3 900 

5 ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATTATC ATGAATATTA 3 96 0 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4 020 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 4 0 80 

w 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 414 0 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 42 00 

}5 ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 42 3 9 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CAAAACTTTG ATTATGTCGT GTCAGTTATT AATGCATACA GTAATGCATC GTTGTTATTT 6 0 

TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 120 

30 

CTGATTTTCT AAATATAAAA TGCCTCCTAA TAACATACTA TT AGTA CATC ATTAAGAGGC 180 

TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTTAAATTT AAGATGAAGA TTCTTGCAAT 24 0 

35 AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 3 00 

ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 3 60 

ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 4 20 

40 GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 4 80 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 54 0 

AACCCACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTTCAATTG TTCGCTATAT 600 

45 

ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 66 0 

TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 720 

CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 780 

50 

AGTGCGCGTT TACACCGTGA ACACACATTA TATAGCGTGA TACATTTTTC GAGCACACGA 34 0 

TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 900 

55 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA 1080 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 114 0 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 124 5 



10 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 
;5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

CTTGTAAAGG TAGTTGTTCA TTTAAATTAA AACAGTATGC TAAGTTnGTG CTTATATTTT 6 0 

GCAACTTCAA TTCGACAGGC TCATCTTGAT ACATGAATGC CTCAATTTCA TCATGTGATA 120 

25 

ACTTTTTCAA AATATCAATT AAATG r ATAC TAAAAACGAT AAATAACATA TCCCAATTTG 180 

AATTCAGCCC TAGCGATTTT AATTTGTTTA AAATCTTATC TTTTTGAAAA ATTCGATTCC 24 0 

TAATGTCTTG TATATCGTTA TCAGTCAAAG TTTCCCAATC TATATGTGAA TGAAGACCTA 30 0 

30 

AATAACACTT ATCCATTAAT AATTCATATA CCGTTAATGC AGAGACATCG AAACAACGTT 36 0 

CTTCACTTAr AAAAACGCCA TCAACATCAA ATAAAATTTT CTTCACAATC CCCACTCCAT 42 0 

35 TTCTGaAAAT TCAGaTATAA ATCATTCTAC TATTTGACTA AAAAAAGCGC AAACCCTATT 4 80 

GAAGTaGATT TGCGCTTTAG CTGTTnAAAT TTTATAAATG TnTTTCAATT TCATCAGCAA 54 0 

CCTGCTGTAC GTGTGTACCG ACAATAACTT GAGTTGAATG TTTGCCATTA ACAGTAACAC 600 

40 CAACTGCACC GGCGTTTTTA ATCTTCTGTT TATCAATAAT AGATGTGTCT TTTAACTCTA 66 0 

GACGCAACCT TGTTGCACAA TTGGTTAAAT TAACAATATT CTCTTGACCG CCTAAACCTT 72 0 

CTAATATTTG TATAGCATGT TGATGATATT TACTTTGTTT AATATCATTT TCACCAGGAG 780 

45 

CAATATTATC TTTTACAACT GTTGGGTCAA CTAATTCATT TTCACCTCTA CCAATCG TAT 84 0 

TCAAGTTAAA TACTTGGATT ACTACACGGA AAAT CACAT A GTATAAGATG AAAAATACAA 9C0 

CACCTTGAAC AAGCAACATC AATGGATGAT TTGATACTGG ATTAATTAGT GATAACACAT 96 0 

50 

AATCTATCAA ACCTGCACTA AATGAAAATC CAGCTGTCCA ATGGAATGTA GCTGCGATAA 102 0 

ATAAAGATAA TCCTGTTAAT AACGCATGAA CAACATATAA GATTGGCACA ACAAACATAA 10 80 

55 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 1200 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 1260 

CTTTTATAGC ATCTTTGCCA C7TTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 13 30 

CACCATATAT GAAAGCTCC 13 99 



w 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 2 9 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTnTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 6 0 

CGTATTAAAT TGAATGGTAG GACCAATCAA AG TAT ATT CA ATCGCTGGAC CATCATTGTT 120 

25 AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 180 

AATATGTTCA TAACCTATTC TTCCTAGATC TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 24 0 

AATCTTAATT G ACATTTT CA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 300 

30 

TATCTCTTTC GATTTGTATA AATTTCTGTT CATCTATTGC ATAAAATTGT ATCCATTCTC 36 0 

CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 420 

CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGTTTGA TTATTCGCAA 4 80 

35 

TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 54 0 

GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TATATTAAAT 600 

40 AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 66 0 

CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 72 0 

TGGTTTGATT CACAGCATGA ACATTTTTTT CATTAAATTT GTTAAGTTCT AAATTTTCAA 78 0 

45 TTAATTTAGA AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 84 0 

GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 90 0 

CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AATCATAATT GTCTGCTCGT 96 0 

50 

TAATAAATCT TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 102 0 

CCTTTTCATT GTATCATTGA GAAG CACAAG TTGTTTAATA AGTAATTCAA ATCGCATATA 10 80 

55 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT 



1200 



TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 



1260 



ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 



1320 



ATTTTTACG 



1329 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO; 337; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA AT AG AAT CAT 12 0 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 180 

TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 3 00 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 48 0 

GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 54 0 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 6 00 

CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 66 0 

TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TAT AAT CAT A TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 78 0 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 90 0 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 96 0 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 102 0 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 108 0 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 1140 



1209 



TACACATTTT GAGGAGTATG TTACAGATCA 
ATTATCATAT ATTGCACATT TCGATCCTGA 
5 TGAAAGAGCA ATGCGTATAC AGGCTAAAGT 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA 
ATATATGTTA CGTGGGGAAT TACCAACAGA 

10 

TATTTTACAC GCTGATCATG AGTTGAACGC 
ATCATTGTCA GATATGTACT CAGGTATTGT 
CATGGTGGTG CAAACGAACA AGTTATGACG 

75 

GTTGATGCTT ACTTAGATGA AAAATTTGCT 
CGTGTATATA AAGATGGTGm tCCTAGaGCG 

20 mCGAAAGACG CTGGTCGTGA AGAATTATTT 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT 
ATGGAAATAC CTCATGACTT ATTCACGCCA 

25 ATTGCTCATA TTTTAGAACA ATATAAAGAT 

ATTGGCGAAA CGAATCGTAA GTATATCCCG 
AAAAATGAAG ATGTAAAATT TGGAGGTAAA 

30 

GGAACTGAAG G ATT AAA C GT ACCTAATGAA 
ATTGGAC CGG ATATTTGGAA GGCAGCAAGC 
TATAATGGCG AAAAACGCaT TGAATGGAAA 

35 

ACAACTGGTG AATGGTTACC TCAAGAAACA 
GTTAAAGGAC CTTTAACAAC ACCAATTGGT 

40 CGCCAAGAAT TAGATTTATT TACTTGCTTA 

TCACCTGTTA AACGTCCACA AGATGTTGAT 
ATTTATGCTG GTATTGAATT TAAAGAAGGT 

45 TTACAAAACG AAATGGGTGC GACAAACATT 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA 
GATAATAACC GTAAATCAGT TACTTTAGTT 

50 

GGCTCATTTA AGCAGTGGGG TTACGATTTA 
ACTTGGCAAC AATATGACGA AATTGTTGAA 

55 
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CGTGCATCCA ATGACAGCAT TACGTACGTC 12 60 

TGCTGAAAAT GAATCAGATG AAAATCGTTA 1320 

AGCATCATTA GTTACAGCGT TTGCTCGAGT 13 8 0 

TCCTGACTTA AGTTATGCGG CAAACTTCCT 14 40 

TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

ATCTGCATTT ACGGCACGTT GTGCGGTATC 1560 

AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 1620 

ATGTTATCTG aGATTGGGTC AaTTGAAAAT 16 80 

AATAAAGrTA AAGTAATGGG cTTCGGTCAT 174 0 

AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 1800 

GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GATTTTTATA GTGCGAGTGT TTATCACTGT 192 0 

ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 19 80 

AATAGAATTA TGCGTCCTAG AGCGAAATAT 2 04 0 

CTTGrAGaAA GAAAmTAATC AATACAAATT 2100 

ATAACTATGA CTGCAGAAAA AATTACTCAA 2160 

CCAATTATCC CATTTATTAT CGGTGATGGA 222 0 

CGAGTTATAG ATGCTGCtGT TGAGAAAsCC 22 80 

GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 2 340 

CTTGATACAA TTAAAGAATA TTTAATTGCT 24 00 

GGTGGTATTA GAT CATT AAA TGTGGCTTTA 24 6 0 

AGACCGGTAC GTTGGTTTAA AGGAGTACCA 252 0 

ATGGTTATTT TCCGTGAAAA TACTGAAGAC 25 8 0 

ACAACAGAAG TTAAAAAGGT AATTGACTTC 264 0 

CGATTCCCAG AAACTTCAGG TATTGGTATT 2700 

TTAGTTAGAG CAGCTATACA ATATGCTATC 2 760 

CATAAAGGTA ATATTATGAA ATTTACAGAA 282 0 

GCATTATCTG AATTTGGTGA TCAAGTATTC 2880 

AATGAAGGCA GAGATGCTGC TAATGCTGCT 2 94 0 
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TTACAACAAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAGCAACTAT GAACTTGAAT 3 060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

5 GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAA T AT GC AG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 324 0 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 3 3 00 

10 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 33 60 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 34 20 

G 3421 

15 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 
20 (B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 8: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATllGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

30 

TTAATTGGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 18 0 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 24 0 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 300 

35 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 360 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAA7ACAA 420 

40 TCGTCCACAA AT ATT ATT CT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 480 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 540 

ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAGCAAT TGCTAAGGCA 600 

45 TCAGTCACTC TTTCATTAAG ATCAC CTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 72 0 

ATAGCTCCTG TATAAACCGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 780 

so 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 

55 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA CCACTAATAT GAACATCTTG ATTGCTAGAA 1020 

CGCATTTTTA TACTATACAT GACGATCACC TcAATCTTCT TGaTGCAAAA TTTCAAACAA 1080 

5 CCTATCTATA TCTTGTTCAG TATGAAAATA CGACAATGAT ATTCTTAACA TTGGCTTAGT 114 0 

CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TTTAATAATG TTTGATGAAT 12 00 

GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTGaACTATT 12 60 

10 

ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TATTTAATTT 1320 

AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 13 80 

ATAAATTGGC AAACTACTTG AGTAGATGAG TGGTCTACCG TGATTAATTA aCATATCCtT 14 4 0 

15 

CaCATCaTTT GaACTkaAAA TcACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 1500 

AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATtCCAAAAC TATGTGAAAC 156 0 

20 AT CG AGTATC AGTGTTGCGT TAnATTTATG CTTTAATGAG ACTAATTGAC CAATATCCAC 162 0 

AACGTCG CCA TTCGTTGAAA ATACACTATC AGATATGATT ATTTTTGGTA TATTTTGATT 16 80 

AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCACTTT 174 0 

25 TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTCGCA TGATTTTCTT GATCTGAAAA 1800 

CACGACACAA TTTGTATTTT TGAAAATATT AAATAACGCC AAA TT AG CAT CATAACCACT 186 0 

ATTTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATTTCTTC 1920 

30 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGTGATAAC TATACTTCCG 1980 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 2 04 0 

AGATGTATAG TTCGTACATC TCTTATTTTC TACTTCAATA TACTGTCTAT CTATATACCC 2100 

35 

TACCGATTTA AGCGACCGAT ATAACCCTTT CTGTTGTAAT AAATCAATTT GCTCTTGAAA 216 0 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 22 2 0 

40 TAATCATATC GTAGTCTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 22 8 0 

AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGTT GACCTCTCTT 234 0 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GG AATAATGC TACGATTCTT 24 00 

45 AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TTGTGCCTTT GATTGGATGC 24 6 0 

AAAAAATTAA TCGGAATACT GTCGGCATCC ATTTCTTTTA AAGCAAATGC CATATCAACA 2 520 

ATATCTTGAT TAGATTCTCC CATACCACAA ATCACGCCAG AACATGGTGA TATATTATTC 2 580 

50 

GCTTTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 2 64 0 

TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTGACTTA 2 700 

55 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2 820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC GACCTTTGCT 2 880 

5 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2 94 0 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 300 0 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3 06 0 

10 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 312 0 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
/5 (2) INFORMATION FOR SEQ ID NO: 33 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 33 9: 

25 

CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 6 0 

CAAAAGCCTA AATAAATTAC AGCAATTTAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

30 

GATAATCaAA ATGCGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 3 00 

J5 CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 3 60 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AAT CTATCAT 4 20 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 4 80 

40 AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 54 0 

TGTTGTTTsA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 6 00 

TGCTGTTTCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 660 

45 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 72 0 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCCGGA 780 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 84 0 

50 

TTTATACGAT TTGCAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 

ATAATTGATT TAACCTCTTC GAACACTTCA CCATAATGAA CTGTATCAAT AACATTATCA 96 0 

55 
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ATTTCATTTT CAGCTGATAA AGCACCATGA T AT CCAT AAA AGCGCATACC TTTAAGAAAG 10 SO 

ATTGTGTCTT GCATTTTCAT TCTCCTTTAA AAAATCTATA CCTTTAGCTA ATTTAGCATT 114 0 

CAACTCGACA TTATGAACGC GTACTGCTCT AACGCCTTTC ATAATACCAT ATGCAGTCGT 1200 

AG CTGCAGTT ACTTCATCTC TTTCAACCGG TGTTGTATCA TAACCCATCA TCTCTTTAGT 1260 

GAAACGTTTC CGGCTTGTCG CTAATAAAAC TGGATATTCT GTTGCAACAA GTTCATCCAG 1320 

TCTTGCCATA ACTTCGGCTT CTTCATTTCT AGTTTTAGCG AAACCTATAC CTGGATCTAG 1380 

CCAAATTTTA TTTGAAGGTA TACCAGCTAT TTTAGCTTGA TGTGCTTGTG CTAACAAAGA 14 4 0 

TGTTAACATT TCTTCGACAA CCGGTTCATC ACGATTACCA TTTCCATTAT GCATTAAAAT 150 0 

AATTTCCGCG TCATAtTTAG CTACAATTTG GAACATACGA TGATCATACA GACCgCCcAT 1560 

tGATCATTAA TCATATCAAC GGCTAATTTT AAACATGCTT CAGCAACCTC ACTTCGAAAT 1620 

GTATCGACTG AAATTTTTTA CATCAnAACC GACAATAGCT TCAnCAACAG TAATACTCTG 16 80 

TTCCATCTCT TCTG 16 94 
(2) INFORMATION FOR SEQ ID NO: 34 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAA GTACACTATT ACTATAGTCA 60 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 12 0 

TTATTTGCAG ATGTTTCATC TAT C CATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 180 

CAAACGTCAC GTAAATTCGC TAAGTTATCA ATGGATTCGA CATCTACTTC TGCATATTTA 24 0 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAGC 3 00 

AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 360 

4$ AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCGACAA TAATCATCGT TGACTTTGTT 42 0 

TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 4 80 

GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 54 0 

ACGCGTTGTT TTTGACCTCC AGATAAATCT TGCGGTCGGT GATTTAAATG TTCTATCATG 60 0 

CCAACTTTTT CAGCCCAATA ACTTACATTT TT CTT CAT AT CATCTTGTTC AACACAATTA 66 0 



35 



40 



50 
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TCTTGTAAAA CTGTACCAAC AA CAT TAG AT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCG 84 0 

5 TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACGCGAAG CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 102 0 

10 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 106 0 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCGCA ATAATGCCAA CTGAAATAAT 114 0 

/5 ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 12 00 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTGACC 1260 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 13 2 0 

20 TCCAATCCAG C CATAT AAAC AGCCTATAAT TCCCGAGC 13 58 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 1: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 6 0 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 12 0 

35 

CACATATGAC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 130 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 24 0 

40 CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTTTGTCA TTACAGATAT 3 00 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT CAGAATATGG 360 

TATTTCATTA ATCTTTGTTG AAGATGtGAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 42 0 

45 TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 48 0 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 54 0 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

50 

AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 660 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 7 20 
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AGGGAAATCT GAGATTATCC AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 84 0 

TGAAGTTGCA TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA 900 

5 TTTAGTCCAT TTAGTTGGTA CGATTACAAA CTTAGATGGC GATGAAGCGA TGCGTGCCTT 960 

AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 1020 

CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 1080 

10 

TTTATTCATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA GAACAACCTG ATTTTATGAA 114 0 

AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 1200 

ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT TTAAGTTGGC 1260 

15 

ATT AAAAG T A CAAGATAGAC AAGACAGTAA TGAAATTTTA AAAACACCAG ATGCAGCAGA 13 20 

CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATG AATT AT t 13 80 

20 CCAATCTGCA TGGAGTGGTG CAACATATGA CATCGAAGGC GATAAATTAG AAGTTGAAGA 144 0 

TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA GCAATCAACA AAGACTTGAG 15 00 

TGGACTTGAA GATGAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA 156 0 

25 TATCGAATCT ATTACAACAC GATTAGAAAT CGAAGAAGTT AAGCGTCCAT GGCTACCACC 1620 

ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATTATGGTC 1680 

AG ATG ATG C A AAAGAAGTGG AATTAACATT AGGACTTAAA GACGTACCAG AAGAACAATA 174 0 

30 

TCAAGG AC CG ATGGTATTGC AATTGAAAAA AGCTGGGCAC ATCGCGTTAA TCGGAAGTCC 18 00 

AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG 1360 

TCCTGATCAA GCACACATGT ACTTGTTCGA TTTCGGTACC AATGG TTTG A TGCCAGTTAC 192 0 

35 

AGACATACCA CATGTCGCTG ATTACTTTAC AGTAGATCAA GAAGACAAGA TTGCTAAGGC 19B0 

GATACGTATA TTTAATGATG AAATTG AT CG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 204 0 

CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTTATTCT 2100 

40 

TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 2160 

GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 2220 

45 AGCTAACGCT ATGAAAACAC CAATGTACAT TAATATGAAA ACGCGTATCG CAATGTTTTT 2280 

ATATGATAAA TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 2 34 0 

TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 24 00 

50 TAAACATGAT GAGACCAAAT CATATAATGA TCAAATTAAT GATGAAGTAT CGGCGATGAC 24 6 0 

AGAATTTTAT AAAGGTGAAc ACCAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 2 520 
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GGATTAGATT ATGAAGGTGT TACACTACAA AAAATTAAAT TAACTGAACC AGCAATGATT 2 64 0 

TCATCAGAAA ATCCGAGAGA AATTGCGCAT ATTGCTGAAA TTATGATGAA AGAAATTGAC 2 70 0 

ATATTAAATG AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT TAAAGCTTAT 276 0 

AGGCATCAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAACTA 2 82 0 

ATGATTGAAG ACTTAAAGCA AAGAGAAATG GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

ATTATCAATG ATTTTAAAAC ATTTATTGAT TGCACGTATA TTCCGGAAGA TGATGTTAAA 2 94 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTCATAAA 3 00 0 

GAATTAAtAG ATGCTTATGA TAAACAGATT GATGTTGCAC GTAAAATGAT TAACCAATTT 3 06 0 

AG T ATAGGT A TTCGTATTTC AGACCAACAA TTCTTTAAAT TTAGATTTAT TCAACGAGAA 312 0 

CCTGTTATTA AAGAAAATGA AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

TGGTTTAAAT AGCAATGAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 324 0 

AAACAATGGT TAAGTCGAAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGCATG 3 3 00 

AAATTGAAGT TCGTTCTGGA TATTTAAGAA AAGCTGAACA ATATAAGCGA TTAGAATTTA 3 3 60 

25 ATTTGAGTTT TGCACTAGAT GATATTGAAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 34 20 

CTAGTGCTAA TAAGGACAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 34 80 

AAAAAAGAAA TTTGATGAAA CAAAAGCTTG AAATGTTGGG TGAAGATATC GATAAAAATA 3 54 0 

30 

AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3600 

AAGCAATGAA TTAATATTGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 3 6 60 

GTGGCAAGGT TGATCAAGCG AAACAATTAG CGGCAAAAAC AGCTAAAGAT ATTGAAGCAT 3720 

35 

GTCAAAAGCA AACGCAACAG CTCGCTGAGT ATATCGAAGG TAGTGATTGG GAAGGACAGT 37 80 

TCGCCAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 3 840 

AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3 900 

40 

ATACATTATC AATTAAGCAA GGG CTTGATA GGGTGAACCC ATGATGAAAG ATGTTAAGCG 3 960 

AATAGATTAT TTTTCTTACG AAGAATTAAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT 4 020 

45 AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4 080 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC TGATGCAGGT TTTAAAGTGG CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GATATGTATT TTGCACCATT 4 2 00 

TAGCTATGAA AAAGATGAAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 4 260 

ACGAATTATC AATAAAGACA TTGCATGGTG GTCGATTGTA CAATCATATC CTTTATTGAT 4 320 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTGAAA TATACAATCA 444 0 

TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 4 50 0 

5 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4 557 
(2) INFORMATION FOR SEQ ID NO: 342: 
(i) SEQUENCE CHARACTERISTICS: 

10 (A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCGAAGTATT TGATGTTTTA 60 

20 CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 12 0 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 180 

C CATC AC AAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 24 0 

25 AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 3 00 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 3 60 

G a TTGGTTT A AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 4 20 

30 

GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 48 0 

GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 54 0 

TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 6 00 

35 

GATTTAATCA TTTAG CG ACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 6 60 

TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 72 0 

40 GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA AAATTAACAC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 84 0 

AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 900 

45 GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGGCGCAT 96 0 

TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 102 0 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 1080 

so 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 114 0 

ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 12 00 
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GTATTGGTGT TTCTTCTGGT GAAAATAGAG 
CTCCATTACT TGAAACATCT ATCGTTGGTG 
5 GCGAGTCATT GTCATTATTT GAAGCACAAG 

ATGAAGACGT TAATATGATT TTCGGTACAG 
TTGTAACAGT TATTGCAACT GGTTTTGATG 

10 

GTAGCACTGG ATT CGGAACA AG CGTAAATA 
CATTCACTTC AAATTCATCA AATGCACAAG 
CAACTAAAGA AGATGATATT CCTAGCTTCA 

15 

GAACAAGACG TTAATCGGTT AATATATATA 
ACATATCTGA TGATTTTTTT ACTAATTTTT 

2 0 aAAGTTATTA AAAGTGTTTA AGTATCGTGT 

ATTTTAACTT TAAACATAAA TG TT AT ATT A 
AGATAATTTA AATGAAATGA TGGTGACGAT 

25 CATCATTTAA TATATGAAGA GTTATTACAA 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT 
CGCCCATATA ATATTACTCA ACATCAATTG 

30 

AAAAATTGGG TGTTTCCCAT TCAAACACAT 
GATATAGGCA CAAATATAGA CACTTTAACT 
ACATATGATA GTAATGTCTT ATTAACGATG 

35 

TATAGTACAA AACATCATTT TATTGCATTG 
GAAATTGTAA AAGAAGTGCT AAAACATGTG 
ATT GG AC CAT CTACATCATC AAGTTATGAA 

40 

ACATTGCCAA TTGATAGTGC CAACTATATT 
GATTTGAAAA AAGCCAATGC TGCATTATTA 

45 TATACGACAG CGTATGCTAC ATCTGAACAT 

AAAGGTCAAA CAGGACGCAT GTTAGCATTC 
GTTTGCGTGT GAAAGATAAT TTACAACAAA 

50 AAAATAATTT TTCAACAAAA CCAAACGTGA 

GAGCTAAAGA AGCGTATGAG GCTGGAATAA 

55 



CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CACAAGGTGT GCTTATGAAT ATTACTGGTG 13 80 

AGGCTGCTGA TATTGTCCAA GATGCTGCAG 144 0 

TTATTAATCC TGAATTACAA GATGAGATTG 1500 

ACAAACCAAC ATCACATGGT CGTAAATCTG 156 0 

CTTCTAGCAA TGCAACTTCT AAAGATGAAT 162 0 

CAACTGATAG TGTAAGTGAA AGAACACATA 16 80 

TTAGAAATAG AGAAGAAAGA CGTTCAAGAA 174 0 

CACAAATAAT TCAACACAAA TCATCAGATA 18 00 

AGaACATGTA GAAGGACATT TAAGTTTTTC 1860 

GAAAATTAAG TCaAAAATTA TTTGCG CAAC 192 0 

TATAATTATT AACTTTGTAC AGTTAGACGA 1980 

CGAGTGAATG ATAATTTTAA AAA G CAAC CG 2 04 0 

CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CAATTAGCTG AAGAAATTGC GTTTGATAGA 222 0 

GAAAATAAAG TCGCTTGTAT TACAAAGGAT 2 2 80 

GATGCGCTTC ATGGTATTGA TGCGATGTAC 23 4 0 

TGTTATGCAG ACTGTGTACC AGTATATTTT 24 00 

GCGCATGCAG GTTGGCGTGG TACCTATACT 24 60 

AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

ATTAATGATG ATATTAAAAA TAAATTTGAA 2580 

GAAACTAGAG GACG AG AT CG TCATGGTATT 264 0 

ATTTATTATG GTGTTCCTAA AGAAAATATT 2700 

TTAGAATTAT TTTTCTCTTA TCGATTAGAA 2760 

ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2 320 

TCTCAACACA AATTAATGAC AAAAGTGAAA 2880 

TTGCAGTTAC AAAATATGTT ACAATAGAGC 2 940 

GACATTTTGG TGAGAATAGA TTGGAAGGCT 3 0 00 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATCGAT 3120 

TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 3180 

TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGATGTT GAT CAGTTT A 3 24 0 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATTGTAGG TTTAATGACG ATGGCACCAT 3300 

TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 3360 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTAT CAATG GGAATGAGTA 3 42 0 

ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTGGG ACTAAACTTG 34 80 

TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 3 54 0 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3 600 

AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC 3 660 

TGCATCAAGA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 3 72 0 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATG CTACA CCAAACAATG CATCACAAGA 3 7 80 

AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 3 84 0 

TGAGC7TAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3 900 

GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 3 931 

(2) INFORMATION FOR SSQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 12 0 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 3 00 

TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 360 

GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 420 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 
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AGGTATTAGA TATTACCTTT TCGAAAAACG ATACCATCCA TTTATTGCAG CAATTGTAAT 600 

TGGACTTATC GCACTCTTAG CTTGGCCAAT GAGTGCATCA ACTGGAAGAA ATGACGGTTT 660 

AGGTATAACA ACGCCTTCAG CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 720 

TATTGATTGG GGTGTCTTTT TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 7 80 

AGGATCAAGA GAATTTAAAT GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTGCCAT 84 0 

TGGTGGCATA TGTATGGGAT TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 900 

TTTGGTTGAA ACGGCAACGA TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 96 0 

TTGGTGTATG GACAATGAGT CATTTTATCT TTGTTCGTCC AATGAAAAAA GTACACCAAC 1020 

is 

AATCTGCAAA GGTTAAACAG CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 1080 

TGATCAAATA AAAGTGATTG GAAAAGGAGA AATAATTATG ATACACGAAT TAGGTACAGT 1140 

20 AGGAATGGTA TGTCCATTTC CGTTAATTGA AGCGCAAAAG AAAATGGCAA CATTGCAATC 1200 

TGGAGATGAA TTAAAAATTG ATTTTGATTG CACGCAAGCG ACGGAAGCCA TTCCAAATTG 1260 

GGCTGCAGAA AATGGTTATC CTGTAACAAA CTATGAACAA ATTGATAATG CTTCATGGAC 132 0 

25 AATTACAATT CAAAAAGTTT AACGTTATCA TTTTAACAAT AAAATAGATA TTAGATTCTA 13 8 0 

TGGCTACTTC CG CTAATTT A AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 14 4 0 

CATTTTTATA TAGTGTGGCA CATTTTATTC CAAAAGATGT AATAAAACTT AACG CAT TTT 150 0 

30 

TGCTTTTTAT AAATTGTCAG ATTATTATGA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 156 0 

CGATACGATA TTTATGTTTT TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 162 0 

TTTATTTTAT GGTGGGTTAG TTCAATCTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 16 8 0 

35 

GGCAGCAATT GTGCTTGTTA CATTTGTATG GATAACAGTT GGTTTTACAA TTAGTTTTGG 174 0 

GAATGGGAAT TTATGGTTCG GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 18 0 0 

GACTCAAGAA GATATTAGCC CACATATTCC TTTCGCTTTG TTTATGTTAT TTCAAATGAT 18 6 0 

GTTTTGTACG ATTGCAATTT CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 192 0 

TCCTTATTTA TTATTCGTAG TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGCACATTG 1980 

GGTTTGGGGC GGCGGTTGGA TTAACAAACT CGGTGTATTA GATTTCGCTG GAGGTACGGT 2 04 0 

TGTTCATATT ACATCAGGTG TTTCTGGTTT AGTATTAGCT ATTATGATTG GAAAAGGAAA 2100 

CAAACATTCT GAATCAACAC CACATAATCT TATCATTACG TTGATTGGCG GTATATTCGT 2160 

GTGGATTGGT TGGTATGGAT TTAATGTAGG TAGTGCTTTT AC ATTTG A T A ATATTGCGAT 222 0 

GCTTGCATTT ACAAATACTG TCATTTCAGC CAGTGCAGGT GCTATAGGTT GGTTAATTTT 22 8 0 



40 



45 



SO 



1221 



EP0 786 519 A2 



ATTAGTTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 24 00 

TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAATA 24 6 0 

5 

TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2 52 0 

AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT TTATTTATAC 2580 

TGGTGACATA CAT ATT AT AC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 264 0 

10 

TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 2700 

GGAACAAGAA ACGAATATAG GATTAGACAA GATTGTTCaC GGTGAACATG CTTACTTTGA 276 0 

, 5 AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 2 82 0 

TCATTGTCCT GACATTTAAC TAAAGGTTGA TGTTGGGACA TTTTGTTATA CAAAAGTTTT 2880 

ATTTTGAAAT CTTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 2 94 0 

20 AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTGAATAA GTATGAAAAA TATATCTGAT 3 0 00 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 3 060 

GTCAGTAAAA AAACAAGTGA AAAATTAACA AGAATTATAG CAGAACATGA CTATCAACCG 3120 

25 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA 3150 
(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS : 

30 (A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 
GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 
40 ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 120 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 180 
ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 24 0 

TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 3 00 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 36 0 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 420 
TAAAATATAG AAAATACATA AAAGTAAGTA TAG TTATTTT ATTATAATTA TTAAATTTTT 48 0 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 54 0 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT 
AAGAGGTTTT TCTATTTCGC TACTAGTTGT 

5 

CTAAGTAGCT CAGCAAATGC ATCACAAACA 
GATCCAACAG TATATAGTGC AcTTnCAACT 
ATTAAAGCGA TTGATGGTGA TACGGTTAAA 

w 

AGACTATTAT TGGTTGATAC ACCTGAAACA 
GGTCCTGAAG CAAGTGCATT TACGAAAAAA 

/5 GAGTTTGACA AAGGTCAAAG AACTGATAAA 

GATGGAAAAA TGGTAAACGA AGCTTTAGTT 
TATAAACCTA ACAATACACA TGAACAACTT 

20 GAGAAATTAA ATATTTGGAG CGAAGACAAC 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT 
AATTCAGAAT TAAAAAAGTA AATAGTATCA 

25 

AAATTTTAAA TTTAGTATTA AAATGGAATG 
GAAAATAAAA TAATGCTTTA CTTCTATATT 
AGAGCGTGAA GAAAAATGTG AGTTATTTAT 
TGTTACAAAA TATTTAGTGC AAAAGCACGA 
TTAAATGGTT TAATGCAGAA AAAGGTTTTG 

35 TATTCgTACA CTTCtCAGCA ATCGCTGAAG 

AAGTTGAATT CGACATCGTT GAAGGCGACC 
TGTAATTTTA ACTTATTCAA ACAGTCCTTA 

40 GATAACAGTT GGTGTGGTAA AAGCACTAGC 
GAGATTTAAC AATATATAAT GGTTCTAAAA 
TTTCATCTGT CCGACTTTTG ATTTTGAATA 

45 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT 
TTTAACAATA ATAGGATACC ACATCGCATA 
AATCAACTTT TTGTTAATTT TTTATTAAGA 

50 

TTAAAATCTC ATAATAATTC AGTAATCTTG 
ATTAAATAAA AATATTGCGT TTAATTTACA 

55 



ATCAGTAATG TTTCGAAAGG GCAATACGCA 66 0 

TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 

GATAATGGCG TAAATAGAAG TGGTTCTGAA 780 

AAAAAATTAC ATAAAGAACC TGCGACATTA 84 0 

TTAATGTACA AAGGTCAACC AATGACATTC 900 

AAGCATCCTA AAAAAGGTGT AGAGAAATAT 96 0 

ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 102 0 

TATGGACGTG GCTTAGCGTA TATTTATGCT 108 0 

CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 114 0 

TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

GCTGATTCAG GTCAATAATG CTCATTGTAA 126 0 

AATTTTTAGA TCACGATATG ATTTATTATC 13 20 

AAAGTAAGTG TATTTAATAT TAGAAAATAA 13 80 

TTACTATATA GTTCAATGTG TATTATCACA 14 4 0 

TAAAAGTGTA T AATG AAA G T TAAGTAATAA 150 0 

ATAGAATATT CTCCTTTTCA TTTATGAATT 156 0 

CGGAGGTATT CAATATGaAT AACGGTACAG 162 0 

GTTTCATCGa AAGAGAAGAT GGTAGCGACG 16 80 

ATGGATACAA ATCATTAGAA GAAGGCCAAA 174 0 

GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 1800 

CTATAGGGCT GTTTTTTTAT GCTTTAAATC 186 0 

CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

TAAATCGAAC TGATGGAAAA GTTTTTTACT 1980 

TAAAAAAGCG CCAATACAGA ACTTTAATAA 2 04 0 

AACAAGAAGT AATGTTAAAC ACTCAAAATG 2100 

ATATCTTACT ACTTAATTAA TAATTTAACT 216 0 

CTGATTAATT ATTGAGAATA TTTATTGTTT 2220 

TTTTCATTTA AAAGGCGAAA CATTAAAATA 2230 

GCGTCAAATA TACTTATTTC TAATGCTTTG 234 0 
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ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTGCTTT 24 6 0 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCCA TTCACTCATG 2 52 0 

5 TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2 58 0 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT AAAATAGAAA TATTGTAAAT 2 64 0 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GGCAGTTAGT 27 00 

10 

TAAGAAGTCT TGATAATCAC G ATCG CAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 2 76 0 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 2 820 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 2 88 0 

15 

GAAATATCAT TTGTAATCTA TACC CAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 2 94 0 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG TATGGATCTA TAAAATTATC 3 00 0 

20 TTTAATTGCA TAGGGTGAAA TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3 06 0 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCTGTTCAA 312 0 

ATTTAATGTA A CTAG AT AAC GATGATTTGT ATT CTCATAG TTTTTCTTAA CCCATTCAGC 3180 

25 CGCTTTTTTA CCTGCACCAG GATAGCTGCA ACGATAAACT TTCATCCAAT CATTTTTGCC 3 24 0 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 33 00 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA CAATGACCAA AAAATCscCA 3360 

in 

CATGACAGGG CCTTTTGTAA CAATAATATC ACCAGGTTGT AATTGGAATT TGTCATCTTG 34 2 0 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG TTTATTGGGG ATACGACAAC 34 80 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC ACTTAAAAGC TCCTTGTTGA 3 54 0 

35 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 3 600 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT CTATTAAGAA TnAAAAACAG 3660 

TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT GAATTTCATT TATAGCACA 3719 

40 

(2) INFORMATION FOR SEQ ID NO: 34 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 
45 (B) TYPE: nucleic acid 

<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 34 5: 

TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 

55 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 24 0 

5 CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 3 00 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 3 60 

TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 4 20 

w 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 4 80 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 54 0 

1S CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 6 00 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 660 

TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 72 0 

20 ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT G CT CTTCAAT AATGT CTTG A 780 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 96 0 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 102 0 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 1080 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 126 0 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 132 0 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 13 8 0 

CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 144 0 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTGCAGTT 16 2 0 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 1676 
so (2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

s5 (C) STRANDEDNESS : double 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

5 



10 



20 



25 



40 



45 



TACAGTAGGA 


ATCATAAAAC 


CTAATACAAC 


AAATACAAAA 


CCATTTAAGG 


CATAACTAAA 


60 


TGTGTTCCAA 


ATTTGATGGT 


AATTCATTTG 


TAGTTCCGTT 


TGTGCTCTAA 


TTAAACGGTC 


120 


GCGTTCTAAA 


CCATGGATTA 


GACCTGCGAT 


TACAACTGCA 


ATG AT AC CTG 


AAGCATGAAC 


180 


TTCTTCTGCT 


AAAAAGTATA 


CGACAAAAGG 


AGTTAATAAT 


TGAATAAAAG 


TTAAGGTATT 


240 


GTTATCTTTT 


AAACCTTTAT 


TAGCGGTTAA 


GTCTATACGT 


ATTCTAACGA 


CAACGAATCC 


300 


AATAATTGCA 


CCAATAAGTA 


CACCTAGTAT 


TGTTGAAATG 


ATAAATTGTT 


CAACAGCTTG 


360 


GAATAATGAA 


AAGGTACCAG 


TTACTAATGC 


AGTAACAGCA 


ATTTTAAATG 


AAATGATACC 


420 


TfiPTGCATCA 


TTGAGTAAAG 


ATTCACCTTC 


TAAAATCGTC 


ATAGAACCTT 


TAGGTAATAA 


4B0 




GTAATAGCAG 


ATACTGCTAC 


TGCATCAGTA 


GGACATAAAA 


TTGCTGCTAT 


540 




GCTGGCATTG 


GTAAGGCAGG 


CCAAATCCAA 


TGTATAAAAT 


AGCCAACACC 


600 




G CAAACA CT A 


GTGCCATTGA 


CATTAATAGT 


ATAGGTTTAC 


GAT ATT C T AA 


660 


TAATTTTGTT 

1 .ri/T. 1 X L A O 1 X 


CGAGAGACGT 


GGGTACCTTC 


CACAAAAAGT 


AGTGGCGCGA 


TAACGGCAAA 


720 


CATAAATACT 


TCAGAATTGA 


ATTGGAAATC 


AACTTGTATT 


GGAATAATGA 


AAATAACGAC 


780 


A f*PT A ATG C A 


ATTTGAATAA 


AGGCAGTAGG 


AATTTGTGGG 


AATCGATTAT 


TGATAACCGA 


840 


ACTAATAATC 


ACAGCAAAAA 


TAAAAATTAA 


AAATGCTTCT 


AATAGTGCCA 


TACAATACTC 


900 


CTCAAAATTT 


TAATAGTTAA 


TATTTTATCA 


CTTTTAAGGC 


ATAAICjALAI 




J7 O VJ 


ATAAAATGAA 


GTTATTTTCA 


AAAAAACTCT 


AGTATCGGTT 


GAACTGATAC 


TAGAGCGAGA 


1020 


TGTTTAAATT 


ATTGATTGTC 


ATAT CTG AAA 


TGACCGCTGT 


CATTTTGTCG 


TTGTTCATAC 


1080 


GCGAGCTTTT 


CAGCATTCGT 


TTTGTATTTT 


TtATAAAAGa 


AAAATAaAAA 


TATnAACCaG 


1140 


AATGGCGAAA 


TATAAATAGC 


TGCTCTTGtT 


TCGTCACTAA 


AGAATAATAA 


AATGAATACA 


1200 


AAGAAGAAGA 


ACGCTAGAAT 


AATGTAAGCA 


ATAGGCTTAC 


CACCAATCAA 


CTTAAATTTA 


1260 


CTGTTTTTAT 


GTGCCTCAGG 


ATGCTTTTTC 


AAAT 






1294 



(2) INFORMATION FOR SEQ ID NO: 34 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1226 



10 



15 



EP 0 786 519 A2 

ACATGATAAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 120 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 180 

TAAAAATAAT GTGTGTACAA AGTCATTAAT TT AG CAAAT A TTTTTATTTA GTAGTTAATA 240 

ACCATCGATT TGAAATTTAT ATATAATTAT TAG CTAAAT A ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 3 60 

AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTGCATTTTA AATTATGTTT 4 20 

AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAGCTTAATG 480 

ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 54 0 

GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 600 

20 ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 66 0 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 720 

GATAATTGGT CATATCCCCC TTTT CAACTG ACAGAAAAAG ATGG C AAATT ATATGGCCGA 78 0 

25 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA 84 0 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 900 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGACGATGT CGATGGCTTA 96 0 

30 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 102 0 

AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

35 ATTGATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATTCAGA GCTTAAACAA 114 0 

CAAGATACTA AACATGAATT AG ATGTTG CG CCTATGTTCA AATCATTGAT TGGAAAAGAA 12 0 0 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 1260 

40 

GGCGGcAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 1320 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 1380 

GATAGCAATA AGCTTTCACT CGATATTCCA AGCAATCACC GACCTGTAAC AAGCGATAAA 144 0 

45 

AAT AG CAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 1560 

50 GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA T CAAAT CG AT 16 2 0 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 16 80 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 174 0 
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TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT 186 0 

GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 192 0 

AAACTCGTAA AAnCC 1935 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 B: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 60 

20 AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 24 0 

2$ 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 3 00 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 4 20 

30 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 4 80 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 54 0 

35 ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 60 0 

ACAACACCTT A C G ATTT AAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 66 0 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 72 0 

40 TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

45 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 96 0 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 102 0 

50 GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 10 80 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 120 0 

55 
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10 



20 



35 



40 



45 



GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 13 2 0 

ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 13 Si 

(2) INFORMATION FOR SEQ ID NO: 34 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 6 0 

GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 12 0 

ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 180 

TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 24 0 

GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 3 00 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTC CATT A 360 

ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TCATTTTCAT AGGTTATTAC GCAGATCAGC ATAATAATCC ATTCCATATG AGTcCTTATT 60 

TTGGTTATGC AGCACGTCTA TTGGCAACAA GTGGCATTGA CTATACGTAT GTAAGAATGG 120 

CAATGTACAT GGATCCACTT AAACCATATT TACCAGAATT GATGnATATG CATAAACTGA 180 

TTTATCCnGC TGGCGATGGT CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 24 0 

TCATTGCTAT TATTAAAAAT CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 3 00 

GTTATGATAT GAAAGAACTT GCTG CAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 3 60 

ATGAGCCCGT TTCATTAGAG ACATTTG CAG AAATGTATGA TGAACCTAAA GGCTTTGGTG 42 0 

CATTATTGGC ATCAATGTAC GACGCAGGAG CAAGAGGACT ATTAGACCAA GAaTCCAATk 4 80 
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TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TATTTGTGGC 6 00 

ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 6 60 

5 CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 720 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 7 80 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 840 

10 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTGC CAGTATTGGC 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 96 0 

;s TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 1020 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 108 0 

GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 114 0 

20 TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTCGAAAT TCCCGTATGA 120 0 

TGGGGTTCCT ACTCTCATGG AT C AG TT AAA TAAATATTAT CACTATCAGT TTATTATTTC 126 0 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG ACATGAGAAC TTCGGTGTTG 132 0 

25 

ATTGGCATTA CATAACGCTT CCAAACATAT TATTTGGTAA CAATAAGAAA CTATTTACAC 13 80 

AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 144 0 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTATCATAT 150 0 

30 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 156 0 

ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC ACTTTtATTT ATAGcATTTC 16 2 0 

35 TAATCTCAGA AATGCTATA 163 9 

(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 6 0 

50 CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 120 

TCCGAAAATC TAG CAAT AAT TGAATTTACT GTAAACTGGC TAATC GCTGC AGATGCTAGA 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTG C AAAAAG AACAAGATAT 24 0 
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TTCTyTATTT 


GTAaTAACGC 


TGCAAACATA 


TCCATAACCC 


CGCTTCTTAG 


AGCCCCTTTT 


360 




AATTnATnAA 


TTAGGGGCTC 


TTATGCAGTT 


GGTGCATTAG 


CAACCAACTG 


TATTCCTTTG 


420 


5 


TCCCCTTTTA 


ATTTATTaAT 


TAGGGGCTCT 


TTTGCTGTTG 


GTGCATTAGC 


AACCAACTAC 


480 




GTTCAATTTA 


ACCGAATAGT 


TTAAAATTAA 


ATACAAACCT 


TAAATTAGTC 


TAAAACTACG 


540 


10 


CCTTTGGTTG 


TTCAACAAAG 


CTCGCCATGA 


GATTTACAAA 


AGAATCAACT 


TGTGGCAATT 


600 


GCAACATGCT 


CGGATCATAA 


CTCATAAATG 


TCGAACGAAT 


CAGCGGTTCA 


TTATCAATTT 


660 




CTACTTTTTC 


AAACTCAAAT 


TGTTCTTTGC 


TGATATTTTT 


CATCATAATT 


TCTGGCAAGA 


720 


15 


TTGTAACACC 


TACACCACTA 


ATCAACATTT 


CTTTGCAAGT 


TGCTACTTGA 


TCCACTGTAA 


780 




TAGTTGCATG 


GTAATCTTGT 


TCTAAATTAT 


CGTTATACCA 


TTCTTTTATT 


TGATTTATAT 


840 




AAATCGGATC 


AGCTTGAAAC 


TCTATAAATG 


GTAACTTTGT 


AACATCATCT 


CGTCTATTTT 


900 


20 


TTGGAAAAAT 


AAAATAATGA 


TCATCATTAA 


ATAAATGTGT 


GTT AG CTAAA 


TTCATTACCT 


960 




TATTTCCACG 


AGTTATCATA 


ACATGATAAT 


CTCTATGATT 


TGCTTTAATT 


TGTTCAGTTG 


1020 


25 


AACCAACTTG 


CACTTGTATT 


TCAACATTAG 


GAAATTGGGC 


ATTATATAGG 


CTCAAAACTT 


1080 


CAGGAAGTAA 


GGTTTGTCCA 


ATCAAAGAAG 


AACACCCGAT 


TGATATTGTT 


CCATTCACTT 


1140 




CACCAATATG 


TGCCTGCATT 


TTGTCAAAAA 


ATAATCGCTC 


TCTTTTCAAC 


ATGTCACGAG 


1200 


30 


CATGCTCAAT 


AATCATTGTT 


CCTTCAGTTG 


TTGTAATCAA 


TTGTTTTTTT 


GTTCTGATAA 


1260 




AAATATCTAC 


TCCAAAAGCA 


TTTTCAATAG 


CTTTTAGTCT 


TTGTGTAACA 


GCAGGTTGAG 


1320 




ATATATATAA 


AATTTCAGCC 


GCTTTACGTA 


ACGTTTTCGT 


TTCGTCTAAT 


GTTATTAGTA 


1380 


35 


AACGATAGTC 


TTCAATCTTC 


ATAATTTCCC 


C C CAT AAATT 


ATTCAATTAT 


TGAACTTTCA 


1440 




TGGCTACAAG 


CATTCATGAG 


TTCATTACTA 


ACGAATAATT 


TCACCAATTT 


TATTGGTATG 


1500 




GCTGCAGCTT 


GAATTACTTA 


GTTTTTCTTT 


TGTTGTTGGT 


GA'ITTTTAGT 


TTGATTATAT 


1560 


40 


TGCTTAGGCT 


TTATTTGTTT 


GCTTTTTTCA 


ATATTAGTTT 


TATTTTGTGG 


CTTTTGATGA 


1620 




TTTTTTTGAG 


CCTTTGCATT 


AATTTTATTA 


AAGCAGTACA 


TGATTTTCTT 


TTGGAATCCT 


1680 


45 


TTAAAATCAT 


TTTCTAACTC 


TGCCATAATT 


TGATGTGCAA 


TCATATATGC 


TTCATGAAAT 


1740 




TGCTTTTTTG 


TAATTTGCTC 


ACTTTCTAAT 


GCAAACATTA 


AATCATCTTC 


ATCTACCAAC 


1800 




TCATaTcACC 


ACTTGG 










1816 


50 


(2) INFORMATION FOR SEQ ID NO: 352: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 



5 



15 



20 



25 



30 



35 



40 



45 



50 



GCGTTGTCGT 


CGATGATTAA 


TAAAGTATGG 


GTATACCATT 


AAGAATAACG 


CTACCCAAAT 


60 


gAkTGCTAGT 


GACGTGCCGC 


CAATGACATC 


TGAAAAGTAA 


TGTGCATGAA 


AATAAAGGCG 


120 


ACAAAATAAT 


ATGCTAAGCC 


ATAATATTCC 


CATAACCAAC 


GCACTCAACA 


CTTTTGTTAT 


180 


TGTCTTAGCA 


GCAAGTGAAA 


TAATAATGAT 


CATTAAGGCG 


AAATATAATA 


ATGTGCTGGC 


240 


GTTGGAATGT 


CCACTCGGAA 


ATG AAAAGC C 


TGTATCAACG 


GCTAAATGAT 


TATATGGTCT 


300 


TGGACGTAAT 


ACAGTATCTT 


TAATTAATTT 


GTTCATGATG 


ACACCTGAAA 


CCAAATATGT 


360 


CACAAACCAA ACCGCTAAAT 


GCCTCTGTTT 


AATAAACAGT 


ATGATTGTGA 


CGATAATGGA 


420 


AATCAACACG 


ACACCTTTGA 


CATCTCCAAT 


TTCCGCACTA 


AACGTCATAT 


AGTAATTAAA 


480 


CAAATTGTTA 


ACATACTGAC 


GTTGTGGCTC 


ACCGAAATAA 


TCTGTAAACC 


ATGTTAATGA 


540 


TCCCATATCT 


ATATTTTTAA 


GCCATTCTTG 


ATTTGTCACT 


ACACTGTAAA 


ACATACCTAT 


600 


AAATACAATC 


AGCGCGATTA 


AAAATAAAGG 


CACTGTcATT 


TTCGGTGATG 


TTAATTTTTT 


660 


ATCTATCATC 


TTACAATCTC 


CTCGTATCAT 


CATTTTCATT 


TTACAAATGT 


TATCCATAAT 


720 


ATCAATGTGC 


CACAAATTTC 


ACTTTACCGA 


CAATATCAAA 


ATTATAAAGT 


TCATATTGTT 


780 


ATGTATATTG 


CAAATAAAAC 


ATTGTATAAT 


TGAAATAACA 


ATATTTTGCT 


ATTTTCAATT 


840 


TAGTACGATT 


TATATTTATT 


ATACAGAGGG 


GGTAAGGCGT 


ATCAATAGAG 


TTATTTTTGT 


900 


CTATATAGCG 


TTAATCATTA 


CATTAGTTAG 


ATTCTTTACC 


CCTATTCATC 


CATCATTTTC 


960 


AAATTTGATT 


TACTGGATAT 


TTGTATTATA 


TTTTATTCCT 


ATTATACTAT 


GCGTTATCGG 


1020 


TTTCAAGGCC 


GAAAAACTTA 


TTGCAACAAT 


GGTCATTATA 


CCTAATTTTT 


TAGGAATACT 


1080 


TTATCGATTA 


TATGCCTACG 


TCACACATAT 


ACTCTTTATG 


TAAAAGGATA 


GTGGGCATGT 


1140 


CTCGCAACAA 


AT ATG CTTGC 


GTCGACTTGT 


CACCATTCGC 


AAATTTTATG 


ACATACGCCT 


1200 


TTCACGGGCT 


ATATTCAGAC 


CCACGCATTC 


ATCCACGTAA 


TAAACACATC 


ATGTAATAGA 


1260 


AAAACAGCAC 


ACCCAAATAT 


ATGGCGTTGC 


GCTGTTTAAC 


CAAGCATACT 


TCTATAGCTT 


1320 


TAATAAGCCA 


GCAGAAGCAT 


ACCTAACCTT 


CTTAAATATG 


CTTTTCCAAA 


TTATCCTCAA 


1380 


GTTTGAATAC 


GATAATACGT 


TCACCTGTAA 


CTGTACTTAA 


ATCACTATGG 


AAGCTCATCA 


1440 


CTTTGATACC 


TGTAATTTTA 


AAAATGATAT 


CATTCAAATC 


TTGCTCACCG 


GATTCAACTA 


1500 


ATTCAGAACG 


TGTTCGTTTA 


ATATTTAATA 


ATCCTTCATT 


CGTACTACAT 


ACACGATATT 


1560 


CAGCTGGCGT 


TAAGATACCT 


TGTAAACTAA 


TAATCACCAT 


ATCTCTTAAA 


ATGTCTGATT 


1620 
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TTTCAGCTTC 


GATTTCACCT 


TTCGTTCTTT 


TCAT AT CACT 


AACTCCAATA 


ATATTAAAAT 


1740 




TGATTACTTC 


ATCTTTGTAT 


CGTTATCACG 


ACATCAACTA 


TTACATTAAG 


TTTATCATTT 


1800 


5 


TT AG TAT ATT 


TTAAGAAGCT 


AGAACATTGT 


AGATATGATG 


ATATATTAGT 


TACTTAGCAT 


1860 




CGCAACATAT 


CATCGTTAAA 


TCCAACTTTT 


AAAACGCCCT 


TCCTCATTAA 


CGCTCATTAA 


1920 


W 


ACGCAGCCAA 


TGATTAGACA 


CCTTCCTAGC 


GAAATGCTCA 


TTATTCGCGA 


GTAGTCTTGC 


1980 


TACAACATAG 


TCGGGTGCCT 


GAATAACGAC 


AAGTAAACGA 


ATTGGCGAAT 


GATACATCGT 


2040 




CCGATCAGCA 


GCCATAACAG 


ATTGCCATGA 


TAAGCCATAC 


ATCAGATCAC 


TCGCATTACC 


2100 


15 


TTGCATGACA 


CCAACACCTG 


ACGTGACGGT 


TTGTGTCGCT 


TTATTCCCAC 


TTCCGTAAAA 


2160 




ATGCGGCGCA 


ACTGTCGACG 


CATAATATTG 


TAAATTAATC 


CATTGTGCCA 


CAAGTGCCGG 


2220 




ACCAGAAATG 


ATGGTATTTA 


ATAATGTGCC 


ATCTTTATCT 


TTACGCCAAT 


CATAATTGTG 


2280 


20 


TAAAAATGTC 


CGCCCTTCTA 


AATCAATGCC 


TTTTGTTAAT 


TGGCGTCGTC 


CAATTATAAA 


2340 




TGATGCATTT 


TTAGCCAATC 


CCCATTCTGG 


ACGTACCTCA 


CTCCAATCAC 


TCGCAAACCG 


2400 




CTGCGCTTCT 


TCCACTGGAT 


GATTCACACG 


ACCAATCGTT 


GGCAGTTTGT 


CCAAACGTTC 


2460 


25 


GCGATTCGcg 


TGtTCAGAAA 


TCATCGGCAT 


CGCGTCATTC 


AATG ATT CAT 


ATGCATCTAA 


2520 




AG CAATAGAA 


GATAATGTGT 


CTGGCACATA 


TACCCATGCC 


AACGTATCAG 


TAGACGTATG 


2580 


30 


ATGTTCTGCT 


ACCGCAAAAA 


CAGTTGTCTC 


TGGAATATAC 


ACACCTGATT 


GTTTTAATCC 


2640 




TTGTCTGACA 


TTTGGACGAT 


TACATATCAT 


CGCTAATAAC 


TTAGCATTAA 


AACCGCTTGA 


2700 




TGCGCCACCA 


CAAGCCCCAC 


ATTCAAGTGA 


TGCATGATGT 


GGATTATTGT 


GAGAATGACT 


2760 


35 


AGCATGACCT 


GCTAACACAA 


CGAACGGCGC 


AAATGCTTCG 


GTTAAATCCA 


TCAATTTCAA 


2820 




CGCTTGTAAC 


GCGAAATCAA 


TTTGCTCTTG 


CTCAGTAAAT 


CCAACAGGTA 


AGTCTGATGT 


2880 




TCGGTCAAAC 


TCACGATCAA 


TCGTCAACTT 


TGTTTCAGGC 


TTTTTCAACC 


ACTTTTGTTT 


2940 


40 


TATTTTTTGT 


AAAGACGCGC 


G ACTTTTT CT 


AGGCATAATC 


GAATTGACAA 


TGGTACTTAA 


3000 




GCTTAAAAAT 


GGCCCACTTA 


ATTCAGGCAA 


TAACAGACTA 


GG CATGACAT 


TATTTTTCAT 


3060 


45 


CAATTTAAAT 


GTGTAAAACA 


TCGATGACAT 


TGTCTGTTGC 


TGTTGTCGAT 


AAACATTCAT 


3120 




TrTHfAAATT 


CTTT AATG CG 


ATATGCCGGC 


GGTACCATGA 


CAGGTAATGA 


3180 




ATCATGTTTG 


AATTGTTCGT 


CTACGGCATC 


TTTTTGAATA 


GGTAATCCAA 


AGAAGCCTGC 


3240 


50 


AATACCAATC 


GTTTCAAAGG 


GCCCTGCTGC 


TTCGATATGT 


CTACGAAATG 


GTTCTGAACG 


3300 




AACATCTATA 


CAAAATGCAA 


TTTGCGCTTT 


CGTTGATGTG 


CCCATCTGAT 


TTAGCTCGCT 


3360 




ATTATTTTCA 


TCAACTGCTT 


GTGTGTCATT 


TAACAATACT 


GAATGTGGCT 


GATTAGCGTT 


3420 
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TGCTTTAATT TTTTGTTTTA ACTGAGATTC GTATGTCATT TCCCAGGCAA TTAGCCATAA 3 540 

ATTTTTAAAT ACATTTTTAT TCATAGTTGC TGCAAAATGA ATAAACGTTT GAATTTCATT 3600 

GACGTCATGT TGTAGTAATA CATCGCTAGG CATATCACTG TAGTAACACC ATGATGCAAC 3660 

AGTTTGCTTA AACCAATTTT CCGATCTACT TTCACAATCT TTAGCGACTG ACTTAAACTC 3 720 

ATCACCAACT AGCAATTGTT CGACAACTAA CCGAATTGCC AAATAATCCG TTAACAAATG 3 7 80 

TTGTTCAAAG TGATGCTGTT GTGAACGGTA ATACAACATA CCTGCCCAAC CCGGTAACGC 3 84 0 

CAAAAGATGT CCTTCAACAT AAGCTTGGTA GTCTTCCTGA TCTATTGAAA AATGAGTTAA 3 90 0 

TACTGACTCT ATCGTCATTT CAGGATCATT GGGTAAGCCT TTAATCACTT GGCGCTGTGC 396 0 

TTTAGTAAAA CTATGGTCAT GTTGCGCTAA ATGCAACCAT GCATGGTAAA AACTTTGcTC 402 0 

ACGCTTCGGC ATTGTCCAAC TCGATAGAAA TTGATCGATA TAAAGTTTCG TCCATTTAAT 4 080 

20 CATTTGACGA TTCACTTGTT CGCTAAGTGG CTCACCTTGT TCATCTATTA TTG CATC ACT 414 0 

CATCGGACGT ACATCATAGT GATGATATGA TTCAGCCATA TCACGTTTTG ATTTTTCTAA 42 00 

TAGTAGATCA GCAACAACAT CAACATTTGA ATGATTCATA TATGATGCAG GTACGTCTTT 4260 

TAATGTTTTA ATGTTATCAA TATAAAGATT GATGTAGTGT TGCGGGATAT TGTAGTGATG 43 20 

TTCAAGTAAC ATATCAGTAA CAAGTTGATT AAAGACACTT TCATCTAATT CACCACGTGC 4 3 80 

CACAGCGCTT TCTATTAATG CTTTATTTGG GAAAATATCC ACATCTCGAA CATCACGTAA 44 4 0 

CCATTTTGCG ACATCTTCAA ACGTATCCGC TTCTAATCCT TCCCATGGAT TTCGTGCTGC 4 50 0 

AAAAATCGAA ATTGGTGATA ATGGTGTAAT AACACGTTTC GCATTTTCAA TGACTGAATT 4560 

GATATTTAAC TGTGTTGTCA TACCTTTCAC CTCCTATAAA TACTTCTTCA AATAATTCGG 4 620 

ATGACTTTCT ATCGCTTTCG AGCGTGCTTC ACCTAGATTA ACTAACCACA CGTACAATAC 4 6 80 

CGCAAAAGCC TTAGAGTATC GATGCCGCGC CACCCAAATA CTTAATAAAC TGCCAAAGAT 4 74 0 

TAAAATAACA ACACTAATGA TGACACTCAC TGTAGGCGGC GTTGTCGCAT GTGTTGTTAT 4 8 00 

ATTTTGTAAT ACAGCGTAAA AATAATTATG TGTGATGACG TAGATAAATG TCACGATTGC 4 8 60 

AATCAAAATC ATACCAACAA GACGTGCCAT GCGTCCTTTA CTAAAGGCTA CCATTTGATT 4 920 

CCAAGATACA AGTAATGACC ATCCTAGAAT GAGTGCACTT AACACTTCAT ATGCACTTCT 4 980 

GTCACTACTC ATCCAAAATA GAAATGCCAC GATAATAGCT AATACACGTC CCATGACAAT 504 0 

CCAGCCATAA GCGTCTTTAG CAGATGCTTG TTTTGGAATA TTGAATCGCT TCACGATAGA 5100 

ACCTGATTGT AAAAATAATG TTGCTTTAAA AATACCGTGC AATATTAAAT GAATAATCGC 516 0 

TGCTGAATAT ACACCCAATG CACATTGAAC TAACATAAAG CCCATTTGAC TCATCGTAGA 52 20 
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AGAAATACTA GAAAGGATAA GTAATAATGA TAACGCAAAT CCATTATCAA ATATCGGCGC 534 0 

AAAACGAGTT AGAATAACAC CACCTGCATT CACAATTCCT GCATGCATAA TTGCCGATAC 5400 

5 TGGCGTTGGT GCCGTTACAG ATTCAATCAA CCATCGATGA AAAGGAAATT GTGCTGCCGG 54 60 

TATCATGACA GCTAATACAA GTAGTACATT CGTCAACAAT GACCATGTCG GATGAACTAT 5 520 

ATGTTGTGGT ACCCGCCACT CGCCAGTCGC AATATAAATA GTTACAATTG CTCCAACGAA 5 580 

10 

TGCAAGCCAA CCACATAAAA ATGTCATGCT TG AT AATTT C GCAGACTCAC GTGGCACTTT 5 640 

CCAAAAACGA TTAACGTTCA TCAGCAATGT TAAACATAAT AATGTAATAC CCCAGCAGAG 57 00 

5 TGCCATCAGT CTTAAGTCTT CAGACATCCA TGCTAAAGAT GCAAACGACG TAATCGCAGT 5760 

GAACAATGGA AAGTAATGTC TATAATGATG ATCACCTAGT AAATATCGCA TTGAAAACTT 582 0 

TTGAATAATA AAGCCAAGCG CCATTACAAA GCCAGCTAAT AACCAAGATA AACGATCTAT 5880 

20 TTTAAATGGA CCTAAGACAT GTTGACCATG AATACCGAAA AAGCCAATGA CTGCAAATAA 5 94 0 

TACTGGCATG ACTAGTATGT ATAAATGTAA TTTAATATAT CTCATTGGCA TAACTGGTGC 6 00 0 

TAAAAACAAC AAGCCACTTA TCAATGCAAT GATAAGCGCA ATAACAAACA GTGAAAATAG 6 06 0 

25 

CAATTGAAAA CTTAACACTG CATAACCTCC TTATTTCTAA TCTCTCGCAT AATTGCTTAT 612 0 

GTATAAAAAT AAAAACCTAC AATAGTAGAT TCTGTACATA ATGGCAGAAA ATTTACTATT 6180 

GCAGGTTTCA GTTTAACTAG ACACTGCATC ACGGTACGTT GATATACCTT GTTGCAGTGT 624 0 

30 

TCTCTTTAAG CGTGCTCCCA TG CACATATG TATATAAAAT GTTACTTCTG TCTGTTCAAT 63 00 

TCATCTTCAT AAATATGCTT TGCCTAGACG AGACCTAACG TGTTATTCGT TTTAAACTTA 636 0 

35 TAACATAAAA TATAATTAAA TTTCTGCTTC ATGT CAAATT CATG AG CTT A ACCTCTATTA 642 0 

AACCAATGAT TGTAAAGATT TTGTAAATGC ACCTGTACAG TTAGGCAGTA TTTCCCGTCC 64 80 

TTTTAAAATA AAAAATTCGC AGTTATGATC ATAACAATTC AAGTTAGGAA AAAAATCAAT 654 0 

40 TACGCACAAG ATAACTATGT A C AATGAAGT TAACTCATAA GCAAAGGAGG TAATCTTAAT 66 00 

GGGTATCATC GCTGGCATCA TTAAAGTTAT CAAAAGCTTA ATCGAACAAT TCACTGGTAA 6660 

ATAAGATTTC ATAACAAACA AAGGAGGTCT TTCACATGGG TATCATTGCA GGAATCATTA 672 0 

45 

AATTCATTAA AGGATTAATT GAGAAATTCA CTGGTAAGTA AGTTATAAAA ATCTCATAGA 6780 

TATGAACATC TTATTTGAAG GGGGCCATTC ACATGGAATT CGTAG CAAAA TTATTCAAAT 6 84 0 

so TCTTTAAAGA TTTACTTGGT AAATTTTTAG GTAACAACTA ATCTCAAACA TTAACGATCA 6 90 0 

ACAACTCATC ACTATGTTAA ATCAACATAC AGGAGGACAA AACGATGGCT ATTGTAGGTA 6 96 0 

CTATCATTAA AAT CATC AAA GCAATTATCG ACATTTTCGC AAAAT AATTT AAGCGAATTG 70 2 0 

55 



1235 



EP0 786 519 A2 



5 



10 



15 



20 



25 



30 



45 



TTATTGATGT 


GAGGTGAGTC 


TTGTTAGTTT 


GTTGCAAATA 


AATGGTCTTG 


GTGTTTTTTG 


7140 


TATAGGACGT 


TCTTAGTGGG 


ACATACGGAA 


TATTCGTGAT 


CTTTGTAGTC 


TGACGCGTTA 


7200 


TATTTTTGTG 


GCGTGTTTTA 


TGTTTGATAC 


TCGAGTTCTG 


AGACATTCAT 


GATTTGGCAT 


7260 


GCGAAATCTT 


AATGATTTTC 


ATGATCTAGC 


GCAAGATATA 


TTGGCCACGT 


GCGGAATTGC 


7320 


GTTGCACGTT 


TAGACTGAAA 


CACTCGTGTG 


ACCGTAAGTG 


TTAATAGTAC 


ATTGATAGCT 


7330 


GCATTTACTT 


CACTCATTTT 


TATGACTGTT 


AAACAATGAT 


TGTACCTTCA 


ATTAACAGTT 


7440 


GGTACGATGG 


TTTTGCCATT 


TTTCATCAAC 


GTAAATATAA 


AAAGGACTAA 


GACACATACA 


7500 


TGTCCTAGCC 


CTATGGATAA 


AATGCAAATT 


TCTGCTTTAT 


CAAAACTATC 


ACACTTTAGA 


7560 


TAGATTGAAA 


ACAAAAAGAT 


CCTAAGAACA 


CCTTAACTTT 


TTATTAATTG 


TCATAAATTG 


7620 


CAAACAATTA 


AGCCACAATT 


CAAAAATGAT 


TATACTTCAT 


TCAACTTATC 


GTGCTGGTCT 


7680 


AATTTGCCAT 


TGATATGGAT 


CTTCAAATTG 


TTGCCAATCT 


GCATCAATTT 


CTTGCGCATT 


7740 


GACTAAGCAT 


GCGTCGAGTT 


CTTTTGTTAA 


TTTTTCTTCA 


TCTAATTCTG 


TACCAATAAT 


7800 


GACAAATTGT 


GTATGACGAT 


CGCCATATTC 


TGGATCCCAT 


TCAGCTGCGA 


CATCTTGACG 


7860 


TTCTGCTAAT 


ATTTGTGTTT 


GTTGCGCTTC 


AGACATACTA 


GCCACCCAAT 


ATGTAACTGG 


7920 


ATGAATATTG 


CAAGATGACC 


CTGCTTGAGA 


TAATAAACAT 


GCTACGTGAT 


TGTATTGTGC 


7980 


TAGCCATACG 


ATACCTTTTG 


ATCGAACGAC 


ATTATTTGGC 


ATGCTTTCTA 


ACCAATCATT 


8040 


GAACCTTTTA 


GCATGGAAAG 


GTAGACGACG 


TTTATATACA 


AACGATGATA 


TACCATATTC 


8100 


TTCTGTTTCA 


GG TGTATGCG 


ATGCATGCCC 


ACCAGACTCA 


AGTTCTTTGA 


TCCATCCTGC 


8160 


TGACTCGCTC 


GCTTTTTCAA 


AATCAAAACG 


CTGCGTATTC 


AAGACTTCTT 


TTAAATCTAC 


8220 


TTCAGAATTT 


GTTGTCTTAA 


T AATTTT AG C 


AGTCGGTTGC 


AATGCGCTTA 


ACATTTTTTC 


8280 


TAACTTCGCT 


AGTTCTTCTT 


CACTAATTAA 


ATCAATTTTA 


TTAATAATCA 


ATACATCACA 


8340 


AAATTCAACT 


TGGTCAATTA 


ATAAATCAGC 


AATCGAACGC 


TCATCTGTTT 


CGTCAACGCT 


8400 


TTGATCACGA 


TCCATCAATA 


AATCTTCTGA 


GTTGATGTCA 


TGTACGAAGC 


GGTTAGCATC 


8460 


CACAACTGTA 


ACCATTGTAT 


CTAAACGGCA 


AATCGCTGTA 


AGATCAATGC 


CAAGTTCATC 


8520 


ATCAATATA r 
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TTGACTCAAT 


8580 


AACAATTTGA 


TCGATGCCAC 


CTTTTTTCAC 


TAAACGCTCA 


ACTTCTTTTA 


ATAAATCGTC 


8640 


TCTAAGTGTA 


CAACAGATAC 


AACCATTAGA 


AAGTTCGACT 


AATTTTTCAT 


CTGTACGCGA 


8700 


TAGTCCCCCA 


CCATCTGCGA 


CAAGATCTTT 


ATCGATATTT 


ACTTCACTCA 


TATCATTTAC 


8760 


AATTACCGCG 


ATACGTCGAC 


CTTCTCGATT 


TTGTAAAATA 


TGATTTAACA 


ACGTTGTCTT 


8820 
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ACTTCAATTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 8 94 0 

7TTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCGCTTTACT 900 0 

TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 906 0 

TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 9120 

AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 9180 

GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTG CAG CGCTTTATGG 924 0 

TTTGTAAGGT GAAGGTGATG AGTAACGGGT AGTTCGGGAG AGGTTAACTT GCGTTGATTT 930 0 

TGATAAAGTG ATCATAGCTT TTAGTACTTG AGGATTTTTA TTGTTGCTGT TACGAATGTG 93 60 

GTCATGTTTA ATGCGGGACA GTAATTTAAG TTGTTTTTTT ACAATTGAGA GTGTGATATT 94 20 

TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 94 80 

20 GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 954 0 

CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACTTTTAACC 96 00 

CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg ctTCGCAATG TTGCGCCATT 9660 

TGTTCAATTA ATACTTTTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 97 2 0 

TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 97 BO 

AGATTAAGTG TTTCTACTGT AGTA CTT AT A TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 984 0 

ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA 9900 

TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 995 6 

(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

TTTCTTTTAA CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 6 0 

TTTAGCAACC ATT A AT AAA C CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 12 0 

AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATT AAAC CAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 24 0 

AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TAT CAG CTT C 3 00 
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AACAACTTTA TCGTTTGCAA CGACTAAACC 
AGACCAATCA TTATTTAATT CAGGCAGCAA 
5 ATCTGTAATG TTAAATTCAT AAGTCTCCAT 

GTATCCTTTA ATAAGGCAAT AATAATTAAT 
ATATTAAATA TTGGAAAATC ATAACCAAAA 

w 

CCTGTTAAAA TTCTATCAAT AAAGTTTCCA 
ACTTGCATAA ACAAATTATA TTGAGCATCT 

15 ATAATAATGG TAATAATAAA GAAAAATGTC 

CCATTATTTC GATGTGaTGT TATGTTTAAA 
ATTTTCATTG TAGTAGCTAT AATATATTTA 

20 GCTATTAAAA TGGAAGTGCC AATAAAATAT 

CGTCCATGAG ACAACTCTTT ATATTATAGC 
TTACATTAAA TCTAATGTTA CTAACTCAAT 

25 

ATATGAACTT AATTATTCAT TTATCATATT 
CTGGAACATA AATCCCTAAA AAGACAGTAG 
CTGCTGTTCT C TATTT AT AC AATACTTCGT 

30 

CATTACTGTA AAATTCTATA AATAGAATTT 
CTCAGCCTCG GtCTTCGACT GGCACTGCTC 
35 TTAACATGTA ATTTTACTTT TAAATACTTT 

AATTGGACAA ATTCTATGAG AATAGATATT 
TtCACTCGAA TGTCAGTTCG AGGAATAAAT 

40 

ATGGCAATTA ATATAAGGAA AATAGCAGCT 
AAAAAAGCTG ATTTCTATCA AATAATTAAT 
CTTAATGTCC CAAGCCCTAA AACTTGTTGT 

45 

TATCTTAAGT ACATAAATGC TAAAAGTATA 
CTTGTATCAA CATTAATAAA TAATAAC CCG 
so TAGCCCATAT ATTTGCCACC TAATAGTTTG 

TTACGACTAT AATTGATATA TGCAATGATA 
GTAGAGATGG TAGTCACATA CGTAAATACT 

55 



TGCTTTAATC CAATCTTGTA TCTGGTTACG 42 0 

CTTATCTACA CGCATACCTG TTTGTTCTTT 4 80 

TACTTAACCT CCTTCTCCTT TTTATTGGAA 54 0 

ATTACACCAA TTGTTAAACT TGAATCTGCG 600 

ATATTTGTAT CAATAAAGTC AACAACTTCT 66 0 

AGTGCACCTG CAAAAAGTAA ACTAATAGCA 720 

TTAATAAAGA AATATACTAA GGCTATTAAT 780 

ATTTTTCCAC TCAATATTCC CCATGcAGCA 84 0 

AAGTGCGGTA TCACTTCAAA TGAATCTCCA 900 

GTAACTTGGT CAAATATAAC GACAAATACT 960 

TTTTTGTGCA TTTTCGTTCC TCCAATCAAT 1020 

TTACACCTGC TAATAAAAAA AGT AAG CAT A 108 0 

ACTTGATAAA CTACTATGTT TTGACATTAA 114 0 

TAAGATGACA TTAAAAATTA GGAAAGCAGG 120 0 

TAAGATATTT TCTAATTAAA AATTATCTTA 1260 

ATTGAATGGC TTCGCTATGC CCATCTGGCA 13 2 0 

TTGATGATGG GTCCCTTCCT AGGGTGCCGT 13 8 0 

CCTCAGGAGT CTCGCCATTA ATACTACGTA 144 0 

AAAAAAATAA GACATGAATC GTCTACACTT 15 0 0 

GTTAATTTAA GAAAGTAGGC TATTTTGAGT 15 60 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA 16 20 

CAACGAGCTG TACATTATAA AATACATATC 16 8 0 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA 174 0 

TATTTATTTG ATTTAGCAGC GATACGTTTA 1800 

AACCAAATCG GAATAAAATA AATTGCACGT 186 0 

AACACAAAAA TGAAGAATAC AAATATTAAG 192 0 

TACGTAGCAT TTTTATGTAG ATCTGGGTTT 1980 

ATCAGACCCC ATACAACTAA AAATAACACT 2 04 0 

TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 210 0 
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TTATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC G aAAAG CAT A 22 2 0 

CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCGGTTAA TACTACAAAA 22 80 

TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 234 0 

TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 24 0 0 

TTnnATATTA A 2411 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 6 0 

AGGTCAAAAT ATTTTTGATG CGCTAAGTCA ATATGAAGTT AAGCGACGCG GCGATATGGA 120 

25 AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 180 

GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 24 0 

AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATTA ACGAAGTATT 300 

30 

GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TT AAGTG AG C AAGATTCACA 360 

AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 420 

TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 4 80 

35 

TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 54 0 

GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 600 

40 TCCAG 605 

(2) INFORMATION FOR SEQ ID NO: 3 55: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 6 0 
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ACAATTAGAC ACTGAACATT TAGCTGATTT AAATCCAGAG CGTATGATCA TTATGACAGA 180 

TCATGCTAAA AAAGATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 240 

5 GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGAT C 3 00 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 360 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 4 20 

70 

CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT 480 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTT AG CG A TATCATTCAC 54 0 

TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 600 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TAT CTGGTTT GTTAATGCAA 66 0 

GCAATGAC 66 8 
20 (2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

30 

ATACAAAAAA A CAT AT CG AA AATAAAG CTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTT CAAAT AAACCTTGGA 120 

35 CAAATTACAA AAATTTAACT AG T CAAAT AA AATCAGTACT GAAGCATGAT AGAGGTATTA 180 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 24 0 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 3 00 

40 ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 360 

TACCATACAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 42 0 

CGAATAAACA ATTAATTAGT T A CAAAT ATT TAAATGACAA AGTGAAATCT GTATTAAAAA 480 

45 GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 54 0 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 

50 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 720 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 7 80 

55 
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(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 357: 
AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGCCATT GCTTCTTCTG 60 
TGATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

IS 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 
AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 24 0 

20 TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCATTTA 30 0 

AGACTTTTAA TCCTTGTtTC ACTAATGATC GATTTTCATC AGTTAAGGAT ACTAAATCCG 36 0 

CAATGGTACC TATCGCAACT AATGCTTTAA AATAAT CAGG TACATTTtCA ATCAATGCTT 42 0 

25 GTGCTAATTT GTATGCAACA CCTGCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 480 

GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 534 
(2) INFORMATION FOR SEQ ID NO: 35 B: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

40 GGTGAGTCAA ATTAAATGAA TCTAATAAGT CATAACTATC TATTTGTAAT GTGCAACGCT 6 0 

TAACGCATAT ACAAAATGAA TGTGCTGATA ATGATTTACT CAAATTAAAA GGTGATTTTT 12 0 

ATTCAATGAT GAATGAAAGT TGCCTTTTTA TTTTTGGTAA AAGTTAATGC GTCAGTGAAT 180 

45 TGTGTAAGTT TTTCAAAAAG TAAAAAGAAA TAATAAAGGT GAATTATTAG AATTCCArAA 24 0 

ATAATTCATT ACATTCATAA AGCATTTTAC AAATGGTAAG AAAATGAGTG TTACAAATCT 300 

AAATATTGCA AAAGAAGCTG ATTTAGTCAC AAAAAATGTC CTATGTAATA ATTCGAGAAA 3 60 

50 

GATGCACTAT ATACGGTCTT CTT ACTATT C AAATGTAAAA GTTGCTTATT TGCGTGGCTT 420 

TTTGTTTTAT AAAAGTATAA AATTTTACTA TAATATATCT TGTAGAGAAC AATGAAATGA 4 80 
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ATAGCAAACT GTATTACTTT GATACAAAAA 
ACTTGAATAT GATAAAGTGA CATATTTATG 

5 

GTAGAATTTC TTGACAATTC TGTCAGTTTA 
AGGCAAACAA GATGACTGAA GAATTCAATG 
ACAAAGTCAC TGGCGAGGTA CAACAAGTTG 

w 

GTGGTAAATT TAATGGGATT ATTCCTATTA 
CAAGTGAAGT TGTAAAAGAG GGCGACGAAG 

/5 ATGAAGAAAA TGAAACTGGA GCTTACATCT 

CTTATAGTTA TTTACAAGAA AAATTAGATA 
AAGTAGTTAA AGGTGGTTTG GTTGTTGATG 

20 TAATTTCAAC AGACTTCATT GAGGATTTCT 

AAGTTGAAGA ATTGGATCCT GAAAATAATA 
AAGAAGAAAA CGATGCTAAA AAAGATCAAT 

25 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT 
ATGGTTTAGT GCATGTATCT GAACTTTCTC 
TTTCAATTGG TCAAGATGTT AAAGTTAAAA 

30 

TTTCATTATC AATCAAAGAT ACGTTACCAA 
ACGAAAATGA TGTCATTGAA GGTGTCGTAG 

35 AAATTGCACC AGGTGTACAA GGACTTGTAC 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC 
ATGAAGAGAA TGAAAGAGTA TCACTATCTA 

40 TTGAAAGTGA TCCTTCTACG ACTAAGGCGT 

CAATTGGCGA TATGATTGGT GATAAACTTA 
AGTCAACTCC ACATGTTTAT GATTGcATGT 

45 

GAATGATAAC GTGGgACAAA TTTAACTAAG 
tTTTTATCAT TACAGTAATA AACTCATTTT 
AAGATATaAG AAAGAGAGGA AGTTAG CTTA 

50 

GGCCTAATGT AGGTAAATCT ACAATTTTTA 
TGGAAGACAC GCCAGGTGTA ACACGAGATC 

55 



TGGTTGTAAT AAATATTTAT CGATATGACG 6 00 

TATATGACTA TTTCGCAAAA TGTAATCGAG 66 0 

TAAGATGTTA TAAATATGTA GTGTATAAGG 72 0 

AATCAATGAT TAACGATATT AAAGAAGGTG 7 80 

AAGACAAGCA AGTTGTTGTT CATATCAACG 84 0 

GTCAACTATC TACGCATCAT ATTGATAGCC 900 

TTGAAGCATA TGTCACTAAA GTTGAGTTTG 960 

TATCTAGAAG ACAACTTGAA ACTGAGAAGT 1020 

ATAATGAAAT CATCGAAGCG AAAGTAACAG 1080 

TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 114 0 

CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

TATTACAATC TTTAAATGAA GGCGATGTTA 132 0 

TTGGTGCATT TATAGACATT GGCGGTGTTG 138 0 

ACGAACATGT TCAAACACCA GAAGAAGTAG 144 0 

TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

CACCTTTCGA AAATATTAAA GGTCAATTCC 1560 

TAAGATTGGC AAACTTTGGT GCATTTGTTG 162 0 

ATATTTCTGA AATTGCACAC AAACACATTG 16 8 0 

AACAAGTAAA TGTTAAAATA TTAGGTATTG 174 0 

TTAAAGCAAC ATTACCAAAC GAAGATGTTG 18 00 

ACTTAGAAAA CGAAGAAGAA GATAATCCAA 18 6 0 

AAAATCTTAA ACTATAATTT AATATTTAAT 1920 

GGAGTATTTT TATGTAACAA AATATACTCG 1980 

TGTTTAAAAA GATArAGTTT TAAGTGCtGa 2 04 0 

GAATACACAG TCTCATGTGA TATTATTAAA 2100 

TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

ATAGAATAGT TGGAGAACGT GTTTCGATTG 22 2 0 

GTATTTATTC TTCAGGTGAA TGGTTAACAC 22 8 0 
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5 



10 



15 



20 



30 



35 



AAATTAGAGC 


GCAGGCAGAA 


ATCGCCATAG 


ATGAAGCGGA 


TGTTATTATT 


TTTATGGTTA 


2400 


ACGTGCGTGA 


AGGATTGACA 


CAAAGCGATG 


AAATGGTCGC 


TCAAATTTTA 


TACAAATCTA 


2460 


AAAAACCGGT 


CGTATTAGCG 


GTTAACAAAG 


TAGATAATAT 


GGAAATGCGT 


ACAGACGTGT 


2520 


ATGATTTCTA 


TTCATTAGGA 


TTTGGTGAAC 


CGTATCCGAT 


ATCAGGGTCA 


CATGGTTTAG 


2580 


GTCTTGGTGA 


CTTGTTAGAT 


GCAGTTGTTT 


CTCATTTTGG 


TG AAGAGGAA 


G AAGAT C CTT 


2640 


ATGATGAAGA 


TACAATTCGA 


CTATCCATTA 


TTGGACGACC 


AAACGTAGGT 


AAATCAAGTT 


2700 


TAGTAAATGC 


TATTTTAGGT 


GAAGATCGCG 


TTATCGTTTC 


TAATGTTGCA 


GGGACAACGA 


2760 


GAGACGCTAT 


TGATACAGAG 


TATAGTTATG 


ATGGACAAGA TTATGTTTTA ATCGATACTG 


2820 


CTGGTATGCG 


TAAAAAAGGA 


AAAGTATATG 


AATCAACTGA 


GAAATATTCA 


GTATTAAGAG 


2880 


CTTTAAAAGC 


GATTGAACGT 


TCAAATGTTG 


TTTTAGTGGT 


CATAGATGCA 


GAACAAGGCA 


2940 


TCATTGAACA 


AGATAAACGT 


GTTGCAGGAT 


ATGCACATGA 


ACAAGGTAAA 


GCAGTCGTGA 


3000 


TTGTCGTAAA 


TAAATGGGAT 


ACTGTGGAAA 


AAGATAGTAA 


AACGATGAAG 


AAATTTGAAG 


3060 


ATGAAGTACG 


TAAAGAATTC 


CaATTTTTAG 


ATTATGCACA 


AATTGCTTTT 


GTGTCTGCTA 


3120 


AAGAACGCAC 


AAGATTACGT 


ACATTATTCC 


CTTACATCAA 


TGAAGCAAGT 


GAAAACCATA 


3180 


AAAAACGTGT 


TCAAAGTTCA 


ACTTTAAATG 


AAGTTGTTAC 


TGATGCAATT 


TCCATGAACC 


3240 


CTACACCAAC 


AGACAAAGGT 


AGACGTTTGA 


ATGTCTTTTA 


TGCAACACAA 


GTTGCTATAG 


3300 


AACCACCGAC 


ATTTGTTGTA 


TTTGTTAATG 


ATGTAGAATT 


AATG c ATTTT 


TCTTATAAAC 


3360 


G CT ATTT AG A 


\j AA I LAAA 1 L 


CCTflCCClCW 

k^,. 1 uLLuv. X i 


TTGGTTTTGA 


AGkTACACCA 


ATT CAT ATT A 


3420 


TAGCTCGAAA 


GAGAAATTAA 


CGATTGGGGG 


ATAACAATGA 


CTAAAATTAC 


CGTTTTTGGT 


3430 


ATGGGAAGTT 


TTGGGACAGC 


CCTTGCCAAT 


GTTCTTGCAG 


AAAATGGACA 


TGATGTTTTG 


3540 


ATGTGGGGTA 


AAAATCAAGA 


TGCTGTTGAT 


GAATTAAATA 


CATGTCATAC 


AAATAAAAAG 


3600 


TATTTAAAAT 


ACGCGAAATT 


A 








3621 



(2) INFORMATION FOR SEQ ID NO : 35 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGATCnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 

55 
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AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 180 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 24 0 

5 ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 3 00 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 36 0 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 42 0 

10 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 4 80 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 54 0 

/5 GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 600 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 64 3 

(2) INFORMATION FOR SEQ ID NO: 360: 

20 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 6 0 

30 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 120 

GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

35 AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 24 0 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 3 00 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

40 TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 42 0 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 4 80 

AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 54 0 

45 CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 6 00 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 6 60 

AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 720 

50 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 78 0 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 84 0 

55 
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GTGGATTGGA TTATTATACA CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG 960 

ATGGTGcCAT TACAACGCTT TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 102 0 

ATGGTCCAAG TGAAACAGGT ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 10 80 

TTGAAGAAGA AGGTATCGAA TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 1140 

TGGGTGATCA AGCAGATCGA TATGCTGTGA AG CT ATT AAA TCATTTGAGA CATAATGGTA 1200 

TTAAAGCAGA TAAAGACTAT TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 12 60 

GTTTAGGTGC CAAGTTTACA ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 13 2 0 

ATGTTAAAAA TATGACAACT GGTGAATCTG AAACAATTGA ATTAGACGCA TTAGTCGAAT 1380 

ATTTTAAGAA GTAGAGAGGG CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 14 4 0 

TTACTGAGGC ATTTTTAGGA CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 1500 

ACCTTGGTGG ATtGATTTtC GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 156 0 

TTAATCCTGC ATTTTCAGAA GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 162 0 

TTGTAGAAGT TCAAGGTACA GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 16 80 

AAACTGGCCA AGTTGAAGTA CAAGTTACAA AT ATT AAAG T GATTAATAAA TCTGAGACAC 174 0 

CACCATTTTC TATAAATGAA GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 180 0 

GTTATTTAGA TTTACGTCGT CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 186 0 

CACGTTCTAT TCGTCAATAT TTGGATGATG AAGGGTTCTT TGACATCGAA ACACCAGTAC 192 0 

TAACGAAGTC AACACCTGAG GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 19 80 

GTGAATTTTA TGCATTACCA CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 204 0 

35 

GATTTGACAA ATACTACCAA ATCGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 210 0 

GTCAACCTGA ATTTACACAA GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 216 0 

40 TGCAAATGGG TGAAGAAATG CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 222 0 

ATGGCGCTTT CCCACGCATG ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 22 8 0 

CAGATACACG TTTTGAAATG GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 2 34 0 

45 TTAAAGTATT TAAAGATACT GTTGAAAATG ATGGTGAAAT TAAAGCAATT GTCGCTAAAG 24 0 0 

GTGCAGCTGa ACAATATACT CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 24 6 0 

ymTGGtGCTA AgGtTAGCGT GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 252 0 

50 

ATGG 2524 
(2) INFORMATION FOR SEQ ID NO; 361: 

55 



30 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

10 TCGTTGAGTA AAAGTCCAGA AAATTGGATG AGTAAACTTG ATGATGGAAA ACATTTAACT 6 0 

GAGATTAATA TACCGGGTTC ACATGATAGT GGCTCATTCA CTTTAAAGGA TCCAGTAAAA 12 0 

TCAGTTTGGG CAAAGACTCA AGATAAAGAT TACCTTACCC AAATGAAGTC GGGAGTCAGG 180 

,5 TTTTTTGATA TTAGAGGTAG AGCAAGTGCT GATAATATGA TTTCAGTTCA TCACGGCATG 24 0 

GTTTATTTGC ATCATGAATT AGGAAAATTT CTCGATGATG CTAAATATTA CTTGAGTGCT 3 00 

TATCCAAACG AAACAATTGT GATGTCTATG AAAAAGGACT ACGATAGCGA TTCTAAAGTT 3 60 

20 

ACGAAGACAT TTGAAGAAaT TTTTAGAGAA TATTATTATA ATAACCCGCA ATATCAGAAT 420 
CTTTTTtACA CAGGAAGTAA TGCGAATCCT ACTTTAAAAG AAACGAAAGG TAAAATTGTC 480 
CTATTCAATA GAATGGGGGG TACGTACATA AAAAGTGGTT ATGGTGCTGA CACGTCAGGT 54 0 

ATTCAATGGG CAGACAATGC GACATTTGAA ACGAAAATTA ATAATGGTAG CTTAAATTTA 600 
AAAGTA CAAG ATGAGTATAA AGATTACTAT GATAAAAAAG TTGAAGCTGT TAAAAATTTA 6 60 

30 TTGGCTAAAG CTAAAACGGA TAGTAACAAA GACAATGTAT ATGTGAATTT CTTGAGTGTA 72 0 

GCGTCTGGAG GCAGCGCATT TAATAGTACT TATAACTATG CATCACATAT AAATCCTGAA 7 80 

ATTGCAAAAA CGATTAAAGC AAATGGGAAA GCTAGAACGG GTTGGCTGAT TGTTGACTAT 64 0 

35 GCAGGATATA CGTGGCCTGG ATATGATGaT ATCGTAAGTG AAATTATAGA TAGTAATAAA 9 00 

TAAGGATTCA ATAATGATAT TAAGACGAGT ATGAAAATAG TTAGATTCTA ATTATTTTCA 960 

CTACTCGTTT TTATTTTGAA AATAAGTAAT AATTCAACAA TATTATAAAT TGAACAGATT 1020 

40 

GTTTGTGAAA TTTTTGATAA TATTAAAGTG AAAAAGTGTT ATAAATTGAT AAATATATGT 1080 

AATTAACAAA AACAAATCAT TTTAAAAAGA AGAGAGTTGT AAGATGATGa AACGATTAAA 114 0 

CAAATTAGTG TTAGGCATTA TTTTTCTGTT TTTAGTCATT AGTATCACTG CTGGTTGTGG 1200 

45 

CATAGGTAAA GAAGCGGAAG TTAAGAAAAG CTTTGAAAAA ACATTGAGTA TGTACCCTAT 1260 

TAAAAATCTA GAGGATTTAT A CG AT AAGG A AGGCTATCGT GATGATCAGT TTGATAAAAA 13 20 

50 TGATAAAGGT ACATGGATTA TAAATTCTGA AATGGTTATT CAACCTAATA ATGAAGATAT 138 0 

GGTAGCTAAA GGCATGGTTC TATATATGAA TAGAAATACC AAAACAACAA ATGGTTACTA 144 0 

mTATGTCGAT GTGACTAAGG ACGAGGATGA AGGAAAACCG CACGACAATG AAAAAAGATA 1500 

55 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62: 





TACATGTTTC 


GGATGCTACT 


TTATTTAGTT 


TGAAGGGTGC 


ATTATGGACG 


TTAGCGCAAG 


60 


15 


AAG TIT AT CA 


AGAATGGTAT 


TTAGGATCGA 


AGTTGTATGA 


AGATGTTGAA 


AAGAAAATAG 


120 




CACGAACTAC 


TTTTAAGACA 


GGTTATATTT 


ATCAAGAAAT 


TATTTTGAGA 


CCAGTAGATG 


180 




AAG TT AAGGT 


ACTTCTGAAT 


GATTTAAAAG 


GTGCTGGTTT 


CGAATTAGGT 


ATTGCAACAG 


240 


20 


GTCGTCCTTA 


TACTGAGACT 


GTTGTGCCAT 


TTGAAAATTT 


AGGATTGTTA 


CCATATTTTG 


300 




AAGCTGATTT 

OiAW^ X XXX 


TATTGCAACA 


GCAAGTGATG 


TTTTAGAAGC 


AGAGAATATG 


TATCCGCAAG 


360 




CACGACCATT 


AGGAAAGCCG 


AATCCTTTTA 


GTTATATCGC 


AGCTTTATAT 


GGTAATAATC 


420 


25 


G CG AT AAATA 


TGAATCTTAT 


ATCAATAAGC 


AAGATAACAT 


TGTAAATAAA 


GATGACGTAT 


4B0 




TTATAGTAGG 


CGATTCGTTA 


GCTGACTTAT 


TAAGTGCTCA 


AAAAATAGGT 


GCAACGTTTA 


540 


JO 


TTGGAACATT 


AACAGGTTTA 


AAAGGTAAGG 


ATGCTGCAGG 


TGAGTTAGAA 


GCGCATCATG 


600 


CCGACTATGT 


TATTAATCAT 


TTAGGTGAAC 


TTAGAGGTGT 


ACTAGATAAT 


TTGTAATTTG 


660 




ATTGTTGTTT 


G AC AG C AT AA 


CTTGTAGTGA 


ATGATTGAAC 


CAAAGGTTTC 


ATATTGAGTT 


720 


35 


ACAATGAAAT 


TAATAATGAA 


AAAATGCCAA 


GAAGCAATGG 


AAGTAATCCA 


ATGTCTTCTT 


780 




GGCATTTTGA 


ATTTACATAA 


ATTGTTTATG 


ACTGTACCGT 


CAATTCAGTT 


GTGAAAATTT 


840 




GATTGTATTC 


ACCAACTTGT 


TTAAGTTCAT 


CAATTATATT 


GTTTGAAACA 


GGTTGATCAA 


900 


40 


CGGATAAAAT 


CATTAGCGCA 


TCTCCGCCCG 


CTTCAGTTCT 


ACCTAAAGTC 


ATAGATGCAA 


960 




TGTTGATATT 


GTATTTACCT 


AACAATGCGC 


CAGTTTTTCC 


TACCATACCT 


GGAGT AT CAT 


1020 




TATGATATGA 


CACAATTTGA 


TATTGATTTG 


GCTTTAAGTC 


TACAGAAAAA 


TTATTAATTC 


1080 


45 


TAACAATTCT 


TGGACCGAAA 


CCTGTAAAGA 


CAGAAGCGCC 


AACTTTAACG 


GAATCGCnAT 


1140 




CGnTTGATAG 


TTCTACCTCT 


AAGTAGTTAC 


TAAAACCTGT 


CTCTGCTTTA 


TTATTTTCAA 


1200 


SO 


TATTTAATGT 


CACTTG 










1216 


(2) INFORMATION FOR SEQ ID NO: 363: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 
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tC) STRANDEDNESS: double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 363: 

ATCAAAATAT TTTAAATAAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG 6 0 

10 ATCAAATATT ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 12 0 

GGATAGGGAG ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 180 

AAAGCATATT AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 24 0 

15 AAGArAGTCA ATTTTTAGAG TCTGATAAAA ATTATTTTAC TGAAGGTAAT ATCAATCTTG 3 00 

GCCTTGCACA TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 3 60 

TTACGATTGA AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 420 

20 

AATTTTGTAA CCACGAAAGA TGGTTGCAGC GTTACGATTT AATTTCTGAA CGTAATCATT 480 

TCAATTTTAT TCGGAATGGT TGGTGTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 54 0 

TAATCGGCCA AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 600 

AAGTAGTAAA TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 66 0 

CACAAATATT AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 72 0 

30 ATATCACTGT ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 780 

TTATAGACAT TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTAG 84 0 

AAGGTGAATT AGGGGT CAT C TAACATT 867 

35 (2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE; nucleic acid 

40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 60 

ACATCCtTGT ATCaCTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 120 

AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 180 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA T CAT AATTAA CGTTGAATAA 24 0 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 3 00 
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CAATCGCATG CCGCAGTTAA TTATTATAGT AAAAACCAAT GTACATGGTG GGCATTTAAA 4 20 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

5 TACAATGCAC GTAAATCAAA ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT 6 00 

AGTATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

w 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTTTTTATT TTATGAAATT ATATTATTAC 780 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 84 0 

15 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 900 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

20 TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCTCATTGTA 1080 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCGAATGTGA AcATCATTTA TTATTACTTT 114 0 

CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 1260 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

30 

ATAATCCCAG TTATAGCGAT TCCATGACCA TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

ACTTTTTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 144 0 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 150 0 

35 

TCATTAATAT CATGGCTATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATGCTCTTT AATTCCATCT 16 80 

40 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 174 0 

AACATTATAC TCAATATATT AAAAATAAGA AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

^5 GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CGCTAGTGGa ATAATTAATA 1860 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT CTTT CAT ACT TACAAG CAAG TCATAAGAAA 192 0 

TGAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

50 GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 204 0 

TGAAGAATCT AAGACTGTTA AAATCGAGAC AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 
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AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATGCGTG TGATAGAAGT TTTTATTTTT 2220 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 2 280 

5 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 2 34 0 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 24 60 

10 

ACACAAGCGC GCGACATGCT GCAAAGTTTT GCGCTAAACC AACGGCAGCA ACTACATGAC 2520 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 2 58 0 

js CTTTTGT AC C ACCGCCAACG ATTGCCAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 264 0 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2 700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

20 CATGTATGCC ATT CAT AACA CCTTTATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2 820 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 2 8 80 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTGG 2 94 0 

25 ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3 000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 3 060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

30 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATG AATTTG C TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 324 0 

TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3 3 00 

35 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACGGGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 34 80 

ATTGTTCTT C TG AT AA C CAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 3 54 0 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3 600 

45 TT AACA ITT C AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3 66 0 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACG CTTACAAGAA 3 72 0 

ACGACAAATc ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 378 0 

50 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3 84 0 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GG CT AAA TT A GCAGAAGCAC 3 9 00 
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CTGTAAACCA 


AGACATCGTT 


TCAATGGGCG 


CTAACGCTGC TAAGGACATT ATAACAGACG 


4 020 




AAGACAAAAA 


GAAAATTGGT 


ATGGTAATTG 


TGGCAACTGA ATCAGCAGTT 


GATGCTGCTA 


4 08 0 


5 


AAGCAGCCGC 


TGTTCAAATT 


CACAACTTAT 


TAGGTATTCA ACCTTTTGCA 


CGCTGCTTTG 






AAATGAAAGA 


AGCTTGTTAT 


GCTGCAACAC 


CAGCAATTCA ATT AG CT AAA 


GATTATTTAG 


a "> n n 


10 


CAACTAGACC 


GAATGAAAAA 


G T ATT AG TT A 


TTGCTACAGA TACAGCACGT 


TATGGATTGA 


a o a n 




ATTCAGGCGG 


CGAGCCAACA 


CAAGGTGCTG 


GCGCAGTTGC GAT GG TT ATT 


GCACATAATC 


/lion 

*i J Z U 




CAAGCATTTT 


GGCATTAAAT 


GAAGATGCTG 


TTGCTTACAC TGAAGACGTT 


TATGATTTCT 


>i t o n 


15 


GGCGTCCAAC 


TGGACATAAA 


TATCCATTAG 


TTGATGGTGC ATTATCTAAA 


GATGCTTATA 


444 0 




TCCGCTCATT 


CCAACAAAGC 


TGGAATGAAT 


ACGCAAAACG TCAAGGTAAG 


TCGCTAGCTG 


4 3 U U 




ACTTCGCATC 


TCTATGCTTC 


CATGTTCCAT 


TTAfAAAAAT GGGTAAAAAG 


GCATTAGAGT 


4 bo U 


20 


CAATCATTGA 


TAACGCTGAT 


GAAACAACTC 


AAGAGCGTTT ACGTTCAGGA 


TATGAAGATG 


4620 




CTGTAGATTA 


TAACCGTTAT 


GTCGGTAATA 


TTTATACTGG ATCATTATAT 


TTAAGCCTAA 


4 6 8 0 




TATCATTACT 


TGAAAATCGA 


GATTTACAAG 


rTClfZTC* A A A r AATCGGTTTA 


TTCAGTTATG 


4 74 0 


25 


GCTCAGGTTC 


AGTTGGTGAA 


TTTTATAGTG 


rrs^r att agt tgaaggctac 


AAAGATCATT 


4 80 0 




TAG AT CAAG C 


TG CACAT AAA 


GCATTATTAA 


ATAAfTGTAC TGAAGTATCT 


GTTGATGCAT 


4 86 0 


30 


ATGAAACATT 


CTTCAAACGT 


TTTGATGACG 


TTGAATTTGA CG AAGAACAA 


GATGCTGTTC 


4920 


ATGAAGATCG 


TCATATTTTC 


TACTTATCAA 


ATATTGAAAA TAACGTTCGT 


GAATATCACA 


4 98 0 




GACCAGAGTA 


GTCGGTGTAT 


TTAAAACACA 


TATAATAAAA CCTAAAAGCA 


GCAGTAAGAC 


5 04 0 


35 


CACTTCTAAT 


TGAAATCGTC 


TTACTGCTGT 


TCTCTATTTA TAACACTTCG 


TATTGAATGA 


510 0 




ATTCATTATG 


CCTATTTGAC 


ACATTATTGA 


AGTTTTC CTA ATGCCTGGAT 


CCTTTATACG 


blbU 




TTACGGCTTC 


GTGCTATGTT 


TTGGTACATA 


AAGCTTTGAC ATATCGATAT 


TCTCCAACTC 


c: t *? rt 


40 


TAACAGCTTA 


ATTTTATTAT 


TAATCGTTCC 


ACCGAACCCT GTTAAGCTAC 


CCGTTTTACC 






GACAACACGA 


TGACATGGCA 


CGATAATAGA 


TAATGGATTA CTTCCGACTG 


CACCTCCAAC 






CGCTTGGGCT 


GACATTTTTG 


GCTTGTTAAG 


CAGCTTGCCT ACrrrI"ITGG 


CAATAbCAL (_ 




45 


ATACGTTGTT 


AGAGTCCCAT 


AAGGAACCTG 


TCTTAATTCA TTCCAAACAC 


ACTGTTGAAA 


5460 




ATGACTACCT 


GTTGGCTTTA 


AAGGTATTGT 


GATTTCAGGA TTGTCACCTT 


TAAAATACGC 


5520 




GTCTAACCAC 


TGTGTCGCCT 


CTCTAAATAT 


CGCTAAAGAC GTATTTTCTT 


CCCTAGTACC 


55B0 


SO 


ATCACCTTGT 


TGATTTTCAA 


ACAAAACAGC 


GGTCAGACTT ACCCCATCAC 


TCAAAAGTTC 


5640 




CAATCGTCCT 


ACAGGCGAAT 


CATAGTAACT 


CTTATACTCC ATAAAAATTC 


CCCCTTTTTC 


5700 
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ATAAGTCGTC 


AATTACGTAT 


ATAAACACGT 


AATACCAGCT 


ATCACTTTGC 


TG CAAT AT AC 


5820 




AGTTACATAT 


CTTACTACAC 


GTGCTAACCT 


CTTACTTTGT 


AAACCAAATC 


TTAAATTAAA 


5880 


5 


ATATTGAAAA 


TGCAATGAAT 


CCTTAATATT 


TTATTAAACC 


TATAATTACT 


TATTAAAAAT 


5940 




AACACACAAT 


ATTCATAAAG 


TTTTAAAAAT 


ATTCTGTTTT 


ATCACCTACT 


ATTAGTGGAA 


6000 


10 


AAGTACAATT 


G CAATTG T AT 


ATAGTTTGCA 


TAACGCTTCA 


AAAGTAATTT 


CTTTTTTGTT 


6060 




TAGTTCAAAA 


AAATTTAGAG 


GTGATGTTAT 


ATGAATAACG 


GTTTTTTCAA 


TAGCGACTTT 


6120 




GATTCAATTT 


TTCGAAGAAT 


GATGAAAGAT 


ATGCAAGGTT 


CAAATCAAGT 


CGGAAACAAA 


6180 


15 


AAGTACTATA 


TTAATGGTAA 


AGAAGTTTCA 


CCTGAAGAAC 


TAGCGCAACT 


CACACAACAA 


6240 




GGTGG CAATC 


ACTCTGCTGA 


ACAAAGTGCG 


CAAgcTTTTC 


AACAAGCAGC 


ACAAAGACAA 


6300 




CAAGGGCAAC 


AAGGTGGCAA 


C GG CAATT AT 


TTAGAACAAA 


TTGGTCGTAA 


CCTTACGCAA 


6360 


20 


GAAGCACGTG 


ACGGTTTATT 


AGATCCAGTC 


ATTGGTCGTG 


ATAAAGAAAT 


TCAAGAAACT 


6420 




GCTGAAGTTT 


TAAGTAGACG 


AACTAAAAAC 


AATCCTATAT 


TAGTTGGAGA 


AGCTGGTGTT 


6480 




GGTAAAACTG 


CGATTGTTGA 


AGGTTTAGCA 


CAGGCAATCG 


TTGAAGGAAA 


TGTACCAGCA 


6540 


25 


GCAATCAAAG 


ACAAAGAAAT 


TATTTCTGTA 


GACATTTCAT 


CATTAGAAGC 


TGGAACGCAA 


6600 




TATCGTGGTG 


CTTTTGAAGA 


AAATATTCAA 


AAATTAATCG 


AAGGTGTTAA 


ATCTTCACAA 


6660 


30 


AATGCCGTAC 


TATTCTTTGA 


TGAAATCCAT 


CAAATTATCG 


GTTCAGGTGC 


CACAGGAAGT 


6720 


GATTCAGGTA 


GCAAAGGGTT 


AT CT GAT ATT 


TTGAAACCTG 


CATTAAGTCG 


TGGTGAGATT 


6780 




TCTATTATTG 


GTGCAACAAC 


ACAAGATGAA 


TATCGAAACA 


ATATTCTTAA 


AGATGCTGCA 


6840 


35 


TTAACGCGCA 


GATTTAATGA 


AGTGCTTGTT 


AATGAACCAA 


GCGCTAAAGA 


TACTGTTGAA 


6900 




ATTTTAAAAG 


GTATTCGCGA 


AAAATTCGAA 


GAACACCATC 


AAGTAAAATT 


ACCAGATGAC 


6960 




GTATTAAAAG 


CATGTGTTGA 


CTTATCAATT 


CAATATATTC 


CACAACGATT 


ATTACCAGAT 


7020 


40 


AAAGCAATCG 


ATGTGTTAGA 


TATTACAGCA 


G CACATTT AT 


CTGCGCAAAG 


TCCAGCTGTC 


7080 




GATAAAGTTG 


AAACTGAAAA 


ACGAATTTCT 


GAATTAGAAA 


ATGATAAACG 


TAAAGCAGTA 


7140 




AGTGCTGAAG 


AATATAAAAA 


AGCTGACGAC 


ATTCAAAATG 


AAATCAAATC 


ATTACAAGAT 


7200 


45 


AAATTAGAAA 


ATAGTAATGG 


TGAACATACT 


GCTGTTGCTA 


CAGTTCATGA 


TATTTCAGAT 


7260 




ACTATTCAAC 


GATTAACTGG 


TATTCCAGTT 


TCTCAAATGG 


ATGATAACGA 


TATTGAACGT 


7320 




TTAAAAAATA 


TTTCTAATCG 


TTTAAGAAGT 


AAAATCATAG 


GTCAAGATCA 


AGCTGTAGAA 


7380 


50 


ATGGTTTCAC 


GTGCAATTCG 


CCGTAATCGT 


GCTGGGTTTG 


ATGACGGCAA 


CCGTCCAATT 


7440 




GG CAGTTTCC 


TATTTGTTGG 


CCCTACTGGT 


GTTGGTAAAA 


CAGAGCTTGC 


TAAACAATTA 


7500 
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GACACAACAG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7 620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 768 0 

GAAATCGAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT 774 0 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7 800 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC G CTGAAG AAA AAGATATTAT GCACGAAATG 7 86 0 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CGCTTCAACG GCATCGTTGA ATTCTTACAT 792 0 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAAGTTACA 798 0 

TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG GTTAATTGAA 8 04 0 

GAAGGCTATG ATGAAGAATT AGGTGCACGT CCATTAAGAC GTATTGTTGA ACAGCAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGATATAGAT 8160 

GTTGAGGATA ACGAATTAGT CGTAAAAGGT AAATAACGAC ACTTTAACAT ATCGCGCATC 822 0 

AAAAATG AG C ATCAGGTCGC CCTTGCCTGT GCTCATTTTT TTAATTATTT CCCTGGAAAA 82 80 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACCACAT 834 0 

TTTCCTTGTT TTGAACGCTT GAAAAATTTT A C TAGTGTAT ATAAGGCATA TCCGAAAATT 84 00 

GCTAAAAAAA TTAAAATGTT AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 84 6 0 

TGTCCGACTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT 8 52 0 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTGCTAC TGTAGAAACA 8 5 80 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG CAGATAGCGG TGTGAATTGA 864 0 

TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AGATAATCGC CAT CG AACTT 87 00 

ACGATAACTT CTTTTGCTAA AAATCCTGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 876 0 

AATCCGAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CCATATGTAA AAAACTTTGA 8 82 0 

TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 8 880 

CCGCCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 8 94 0 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATT CCACAAT AAAGATTGCA 9 00 0 

45 TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC CATAATAATA 906 0 

C CTAAAA C AT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTACG 912 0 

AACAACGCAT ACACTGGCAG TCTAGCAGAG CATGACATGA ATGGTGCAAT TAATATCGTT 918 0 

50 GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATACTCGG TACATTACAA 924 0 

CCAAATCCGA TAATCATTGG TATAAAAGAC TTCCCGCTTA AACCGAACGA TTCCATAATA 93 0 0 
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AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 94 2 0 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA G AT AATT CAT AATCGTCTTC 94 8 0 

ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 954 0 

ATCCATGTAA ATGTAGTTTG AAAGAT CAAC CA CAT AATT G CTAGAAAGAT AGGCATCCCT 96 00 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT ATCTTCATCT 96 6 0 

GGATACGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 972 0 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 97 80 

TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 984 0 

ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGT CATTAT TATTTGACAC 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 9 96 0 

CCTTCCCCTA GATGCTTTAT TTCGCCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 1002 0 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 1008 0 

CCGCGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 1014 0 

AACAATTGTA CTGTTAATTG CATATTTCTT TTTAGTTGAC TCG CAT CAAC AATGTTAATG 1020 0 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAGTTTCATC TTTAGAAATC 1026 0 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 10320 

CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CGAACCAGTT 10380 

AAAGCGTTAA ACAAAGATGT T TT A C CAAC A TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 10500 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 10560 

TTATGATAGC GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 1062 0 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 106 80 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 10 740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTCATT TTTCATGTAA AATTTGTGAC 10800 

45 ATTGCAAAAA TGT 10813 

(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6804 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi > SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

TGATGGATTA GCAGACATTT TACGAGCGAA TGGTTTCAAA GTGTTTGGTC CAAATAAGCA 6 0 

5 AGCAGCTCAA ATCGAAGGCT CAAAATTATT TGCTAAAAAG ATAATGGAAA AATATAATAT 12 0 

TCCAACTGCT GATTATAAAG AAGTTGAGCG AAAAAAGGAT GCTTTAACAT ATATTGAAAA 180 

CTGTGAATTG CCCGTTGTTG TCAAGAAAGA TGGGTTAGCT GCTGGGAAAG GCGTTATTAT 24 0 

10 

TGCAGATACT ATTGAAGCAG CCAGAAGTGC TATTGAGATT ATGTATGGTG ATGAAGAAGA 3 00 

AGGTACTGTT GTATTTGAAA CGTTTTTAGA AGGTGAAGAG TTCTCGCTAA TGACATTTGT 360 

TAATGGTGAT TTAGCAGTAC CTTTCGACTG TATTGCACAA GATCATAAAC GCGCATTTGA 42 0 

IS 

TCATGATGAA GG A CCAAAT A CTGGTGGTAT GGGGG CTTAT TGTCCmgTAC CACATATTAG 4 80 

TGACGATGTT TTAAmACTTA CAAATGAAAC AATTGCACAw CCCATTGCAA AGGCAATGCT 54 0 

20 TAATGAAGGT TATCAATTCT TCGGTGTATT ATACATTGGT GCTATTTTAA CTAAAGATGG 600 

TC CAAAAGT A ATAGAATTTA ATGCCCGTTT TGGTGATCCT GAAGCTCAAG TATTATTAAG 660 

TCGCATGGAA AGTGATTTAA TGCAGCATAT TATTGATTTA GATGAAGGAA AACGTACTGA 72 0 

25 ATTCAAATGG AAAAATGAAT CTATTGTAGG GGTCATGTTG GCATCAAAAG GATATCCTGA 780 

TGCATATGAA AAAGGGCATA AAGTAAGTGG CTTTGATTTA AATGAAAACT ATTTTG TT AG 84 0 

TGGATTAAAG AAGCAAGGTG ATACCTTTGT TACTTcAGGT GGTAGAGTTA TACTTGCCAT 9 00 

30 CGGAAAAGGT GACAATGTAC AAGATGCACA GCGAGACGCA TACAAAAAAG TATCACAAAT 960 

ACAAAGTG A C CATTTATTCT ATCGTCATGA CATTGCGAAT AAAGCACTAC AACTTAAATA 102 0 

AGTAAATTTA AAATACTAAG aTTAGCTATG AACGAATCTA TAACGATAGA TTTTTTCATA 1080 

35 

GCTTTTTTAG TTGTAGAGTC TAGGACATTG ATTTCTGTAC CAAATTTGTG ATTATGCATA 114 0 

TGTAATACAA AAGAGGCGCC ACAACATGTT TGGATGAACA AAAT AA CAT G TTTGTGGCAC 1200 

CTCTTTTGTT TAGTATGGAA TAAATGGTTT TCTTTTTCTA TACAATGAAT TTCTAATTTA 1260 

40 

GTATCTATAC AATTATGGAT AAAATTTAAC CTACACGACC AAGACGAACA TCATCTATGC 132 0 

CCGTGATGGG TAAGGTGATT GAACAATAAT ATGCCATAGT AATAATGGCA ATTAAAACTA 13 8 0 

45 TAATAAAGAT TATATCTTTA TATGAGAAAG GTACGTTGTA ATAGTAAGTA CGAGGACCAT 14 4 0 

CTCTAAATCC TTTCGACTCC ATCGCAACTG ATAATTGATG TGCCTTTCTA ATATTTTGGC 150 0 

TTAATAGAGG TATAATTAAA TGCTTAAATC GCTTTAACCC TCTATAATTT GCCGCGTCTA 1560 

50 TCATCTGATA GCGCATTTTT AAAGATCTGC GAAgcTGTAt TAAAGAACTA ATCATTAAAG 1620 

GTATCATACG AATGGCAGCC ATG AATG CAT AAGCAACTTT TGATTTAACC TTTAAATGTT 16 80 
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15 



ATGAAATAGC AATGGTTCTT AATGATACAT GTAAACCACG AACTAAACTT TCTGTTGTAA 1B00 

TATGGATAAA TCCGAATTTC AAAATTGTAT GGCTACCATT CCCGTATAAA ATCATGAACA i860 

GGGAAGAGAG TAATGCAAAG CCAATACTTA TAGTTATAAA AATTGCTGTA ATTTTAAACT 1920 

GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AG CATAAAAT 1980 

CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCGTTA 2 04 0 

TAATGTTGAC ATCATCAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 2100 

CCATCACAAT CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 2160 

ATTATTTCAT CGTCATGTGT AaCCATGACA ATACTTTGTC CCAAATTAAT TCGCTTTTGG 2220 

AAAAGTTTGA TCAACTGGAA TGTATTATGG CTATCAAGTC CAAATGTCGG TTCATCTAAA 2280 

AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG TTTTTGACCA 234 0 

ATAGACAACT CATAAGGATG TTGATCTTTT ACATTTTGTA AATCTAAAAG TTTTAAAAGT 2400 

TGTATCGTTT CATCATCACT TTGATCTTTA GAAAGGTGAT TAAAATGAAT GTTAATTTCA 24 60 

TCATAAACCG AATTTG TTAT AAATTGTAAT TCTGGGTTTT GATAAACTAG GTACATGTGT 2 52 0 

TTTGCTGCAT GTTTAATTTT TGTTAAACGC TGATTTTCAA AATAAA CAT C ACCTTGATAT 2580 

TTAATCAATT GCATAATTGA TTCAAGCAAG GTTGTTTTAC CACTACCATT TGCCCCTGTA 2 64 0 

ATTGTAATCC ACTCACCTAG ACCAATTTCT AAATCTGAGA ATGAGAGCAA TGTTGATTTA 2 700 

CCGCGAATAA TACGTCCATT TTTAAATTGT AATAAGTGTG AGTTTGTTGT TGGAAAGTCA 276 0 

ACACGACTTG GTGCGAATTC CCATGCACGT GGATGCCACA CACCATATTC ACTGAGTAAA 2 82 0 

TGAACATACT TCTGTAATAT GATTTCAGGA CATTCATCGG CAATGATATT TCCGTTATAA 2 8 80 

TCCATCAAAA TGACGCGGTC GACATGATTC CAGATGTGTT TAACTTTATG TTCAACGATT 2 94 0 

ACAACCGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAAT CTTC TGTTGCTTGA 3 000 

ACATCTAACA TTGCTGTCGG TTCATCTAAA AACAATGTTT TTGATTGTTG AAGAATGGTT 3 06 0 

TCAACAATTG CCAATTTCTG TTTCATCCCG CCACTTAAAT CTTTGATATA CGTTTCAGGG 312 0 

GTAACATTTA AATTGACCAT ATTTAAAGCA TTGATAATTA ACGCATCCAT GTCTTCACGT 3180 

GGTAATTGTC TATTTTCTAA AACGAATGCA AGTTCTTCGT ATACTTTTGG CATACAAAAC 3240 

TGGCTATCAG GGTCTTGGAA AATAACGCCA CTTAATGGGT CAACGATTAG TTCATCATAT 3 3 00 

TTCATAGGTA ATTCAATTAA ATTAGGAACA ATACCACTTA ATACATTCAG AAGTGTACTT 3 360 

50 TTACCGCAAC CAGAAGGACC GAGTAAAAGT ACTTTTTCTT TGTCTTGAAT AGTGATATTT 3420 

AAATGATCGA AAATTTT A C G TTGACCACTT GGATATTTTA ATCGTAAATC ACTTACTTTT 34 80 
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ATTTTGTTAC GCCTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACGCCGGCGA 3 6 00 

CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3 66 0 

5 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTGCTGTA CAAAAACCTG 3 72 0 

CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3 7 80 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 3 84 0 

10 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT TTTGGAATAA 3 900 

TTAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGTTAATTGT TCGAGGTGAA 3 960 

GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 4 02 0 

IS 

CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4 080 

CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CACAATATAT 414 0 

20 ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4 200 

AAGGGTTTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4 260 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGT CAAT AGGTTGTCTT CATGAAATTT 4 32 0 

25 CGTTTAATTC GATTTAAAAT TTATAATATT AGCATTGGAT TTAAATTGAA GATGTAGTAG 4 3 80 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 444 0 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 4 50 0 

30 

CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4 56 0 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 4 620 

AAATCCCCTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATTTAGTGT ATCTTGGATA 4 68 0 

35 

TCTTGTTTTG kTTGaTTAAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 4 74 0 

TCAACTTCTT TAGCTTTTTC TGCTGCTTTT TTATTTTGAT TTT CATTAG A CATGATTAAT 4 800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4 86 0 

40 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTCT TTTCTTAGGA AACACGCCGA AATTATTTTT 4 980 

4 5 AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 504 0 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 5100 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 5160 

50 CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGTCATA 52 2 0 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 52 8 0 
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TTTTCATTAT TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT GAATTGAATC 54 00 

AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTGTGAG TTATGGTATA AGTTACTTAA 54 60 

5 TTACACAAAT TGCATTGTAT GGCAGAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 5 52 0 

TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ATTTGGAATT 5 580 

TCTTGTTCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 564 0 

10 

AATTATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 5700 

TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 5760 

TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 5 820 

15 

ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 5 88 0 

TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 594 0 

20 TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATGG GTTACATTAT 6000 

TAATTCTGTT ATTCATTATT ATCGCATATA TCGCTACAAC GTTAACTTTC CACTTGAAAG 606 0 

GTAAGTTTTA TTATTTACAA TTATTTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 6120 

25 AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 6180 

TATTAATGTT ATTGGCAATC ATCATTTCTT ATTCGTTACA ATTACGAGAG CGAC CAT AAT 6 24 0 

TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 6300 

in 

ATATCTCTAA CTTACTTAGA TACAGGTCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 6360 

CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA r AATTG T ATA CAAAGTTTTA 6420 

GTGCAAATAA AATAGTTGAA AAGTTATCCA TTTGTAAAAT CAAGAAAACT AGTAAATAGT 64 80 

35 

TGAAGCGACT TATGGaATTT GCGAAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 654 0 

TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 660 0 

GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 666 0 

40 

CAGATGAAAA TAGCAACTCT GAACAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 672 0 

TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATTTTTCAA 6780 

45 ATTGTGACTG ATAAATCACA ATAT 6 804 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
50 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 
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w 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

aaaAGAGACG CATTaAAACA AGCTATACAA ATT AT CG AT A AATTAACATG GGGTG TTTAG 6 0 

TGGTGGTTAA AGAAATTTTG AG ACT ATT AT TCTTACTAGC GATGTATGAG CTAGGTAAGT 12 0 

ATGTAACTGA GCAAGTATAT ATTATGATGA CGGCTAATGA TGATGTAGAG GCGCCAAGTG 18 0 

ACTTTGAAAA AATCAGAGCT GAAGTTTCAT GGTAATAGCT ATTATCATTT TTGAATTAAT 24 0 

TATATTAATG TGTGTAGCAA TAGCACTGGA GGTGTTGTAA AT ATG TGG AT TGTCATTTCA 3 00 

ATTGTTTTAT CTATATnTTT ATTGATCTTG TTAAGTAGCA TTTCTCATAA GATGAAAACC 360 

ATAGAAGCAT TGGAGTATAT GAATGCTTAT CTTTTCAAGC AGTTAGTAAA AAATAATGGT 4 20 

GTTGAAGGTT TAGAAGATTA TGAAAATGAA GTTGAACGAA TTAGAAAAAG ATTCAAAAGC 4 80 

TAAAGAGAGG CGTTGGCTTC TCTGCTCTAT CyAAAATAAT GAAAGG AG CC saACATGTTA 54 0 

GaCmAAGtCA CTCAAATAGA AACAATTAAA TATGATCGTG ATGTCTCATA TTCTTATGCT 600 

GCTAGTCGTT TATCTACACA TTGGACTAAT CACAATATGG CTTGGTCTGA CTTTATGCAG 660 

AAGCTAGCAC AAACAGTTAG AACTAAAGAA GATTTAACTG AGTACAATAA AATGTCTAAG 720 

25 TCTGAACAAG CCGATATAAA AGATGTTGGC GGATTTGTCG GTGGATATTT AAAAGAAGGC 780 

AAACGGCGTG CTGGTCAAGT CATGAATCGT TCAATGCTAA CACTTGATAT CGATTATGCA 84 0 

GCCCAAGATA TGACTGACAT ATTATCTATG TTTTATGATT TTGCATATTG TTTATATTCA 9 00 

ACACATAAGC ATAGAGAGAT AAGTCCAAGA CTGCGTTTAG TGATTCCTTT AAAACGAAAT 960 

GTAAATGCAG ATGAGTATGA AGCTATTGGG CGTAAAGTCG CAGATATCGT TGGCATGGAT 102 0 

TACTTCGATG ATACAACTTA TCAACCACAT AGGTTAATGT ATTGGCCTTC AACTAGTAAC 1080 

GATGCGGAAT TTTTCTTTAC CTATGAAGAT TTACCTTTGT TAGACCCAGA TAAAATATTA 114 0 

AATGAATATG TTGATTGGAC TGACACATTA GAATGGCCAA CGTCTTCAAG GGAAGAGAGT 12 00 

AAGACTAAAA G ATT AG C AG A TAAGCAAGGC GACCCAGAAG AAAAGCCGGG AATTGTTGGT 1260 

GCATTTTGTA GAGCCTATAC GATAGAAGAA GCTATAGAAA CTTTTATTCC TGATTTATAC 1320 

GAAAAACATT CTACTAACCG TTATACCTAT CATGAAGGTT CAACTGCAGG TGGATTGGTG 13 80 

45 TTATACGAAA ATAACAAGTT TGCCTATTCT CATCATAATA CGGATCCCGT AAGCGGTATG 144 0 

CTTGTGAACA GTTTTGATTT AGTACGCATA CACTTATATG GTGCTCAAGA TGAAGAAACT 1500 

AAAACAGATA CTCCGGTTAA TCGACTACCT AGTTATAAAG CAATGCAGCa AAGAGCGCAA 156 0 

50 AATGATGAGG TTGTTAAAAA GCAATTAATT AATGACAAAA TGTCTGATGC AATGCAGGaT 1620 

TTCGATGAAn GAGAAAATAG CG ATG ATG C A TGGTCTGAGA CGTTnGAAAT TACTTCGAAA 1680 
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(2) INFORMATION FOR SEQ ID NO: 36 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

AAGATTAGGC ATCAAGACAG GGTCGCGATT GTTTGAAATC CCACATAGnA ATGATATTTa 6 0 

CaTTATCAAT CCaAGTATGC GTAAATATCT TAATGTTTCA GTTGCTATTT CTAAGATTGC 120 

ATTGCGTTAT ATTCCACCTG AAGATTTACA CCAATATAGT ATTGACGAAT TTTTTATGGA 180 

TGTTACTGAT AG CT AT CAT A GATTTAGTTC TACAGTACAT GCATTTTGCG AAAGACTTAA 24 0 

ACGTGAAATT TATGAAGAAA CAGGCATTTA TTGTACTGTG GGCATTGGTT CTAATATGTT 300 

ATTAAGTAAA ATTGCTATGG ATGTTGAAGC GAacATAGTc AAAATGGTAT AGCTGAATGG 360 

CGATATCaAG ATGTACCAAC GAAATTATGG CCAATTCmGC CCCTGCGAGA TTTTTGGGGT 42 0 

ATTAATCGTC GAACAGAAGC CAAATTGAAT AAAAGAGGAA TTTTTACTAT AGGAGATTTA 4 80 

GCGAAATATC CATATAAATT TTTAAAAAAA GAGTTCGGTA TTTTAGGTGT TGATATGCAT 54 0 

CTACATGCGA ATGGGATAGA TCAGAGTAAA GTACGTGAAA AGCACAAGAT CAGCAATCCA 600 

TCGATATGCA AAAGTCAAAT ATTAATGAGA GATTATCATT TTGATGAAGC AAAAGTAGTA 66 0 

ATGCAAGAGT TAATTGAAGA TGTTGCTAGC AGAGTTCGAG CAAGAAAAAA AGTGGCAAGA 72 0 

ACGATACATT TTGCCTTTGG CTATAGTGAT GAAGGCGGTG TACATAAGCA ATATACTTTG 78 0 

AAAGATCCAA CAAACTTAGA AAAAGATATT TATAAAGTAG TAATGCATTT CGCAGATAAA 84 0 

TTATGTAATA AACAAGCACT ATATCGTACG CTAAGTATAT CTTTGAG TCA ATTTATTAAT 90 0 

G AGGATG AG C GACgTTAAGT CTGTTTGAAG ATGAATACCA ACGCAAACGT GACGAATGTC 96 0 

TAGCTAAAAC GATAGACCAA TTACATTTGA AATACGGCAA AGGTATTGTG TCCAAAGCAG 102 0 

TATCGTTTAC AGAAGCAGGT ACAAAACACG GCAGATTAGG TTTAATGGCT GGACATAAAA 10 8 0 

TGTAATGACT ATACGGTTTA AGTAATATAT AACTGTGATT CGTATAAAAT AAGTCTCTAA 114 0 

AGATAAATAT TTCATATATC ACAATAGATT TTCACAATAA TATCTAAGAA TACATGGAAT 1200 

TT AT CAAAAG AGACTTAATA ATTATTGGAT ATAACAATCA AAATCACTCA ATGCTTGCAT 126 0 

ACCGCGTTCT CGGTCAGTAG GGTTTTTGAA ACTAATTTTT AAAGCACCGT ATATATCTTC 13 20 

GCGTACTTCT AAGATTCTTA AGTTGCTTAT AG AT ATG TTA TGTAAACTCA GGATATAAGT 13 80 
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TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1S0 0 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 156 0 

5 

TTTTAGCTGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 1620 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 1680 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 174 0 

10 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 180 0 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 184 7 

, 5 (2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 8: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG AT AAAG CTTA TAACAGTAGT 60 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 12 0 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA T C AT ATTTGG 180 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAG TTTG 360 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AAT CAT CTTT ATAACTTGGT 4 20 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 4 80 

CCGTCCTCTG AAGT 4 94 
(2) INFORMATION FOR SEQ ID NO: 369: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251B base pairs 
is (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 369: 
GTCCATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 
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GATAATTTCn AATGTAGCGA CCATCATTAC GAGATGATTT GATATAAGCA CAGTTTGGAT 180 

GTTGACCAAT ACTATCGCCT TCTTCTTCGA TGATATCTAT TTTAATACCA TCATCAGCTG 240 

5 CAATTTCTAA TGAAGATTTA ATTCGGTTAT CAAATGTTGA ATATCCCATT GCTCCACCCA 3 00 

CAATAGCGAC ATCTGTACCA TGTCCTTGGT GTGTTTGAGC AAATGATTCA TAATAATGTA 3 60 

TTTCAATATT TTTAGGAATA TCTCCCAATA TTGCGCGTGC TGAATTCCCA ATCTTTACTG 420 

10 

CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 4 80 

TTTGATAATC ATAGCTCTTT GCCATAATTA AACACTCTCC TTAATATGAT TCTTTTTGCT 54 0 

CGGCATTTTA AAGTTGATAT TCATTAAATT AAACTTATTA ATTAGTGTTC CAATAATATA 6 00 

15 

GGCTAAGACA ATGCTGACAA TAATCACTGT TGCGATTGTT ACAATTGATG TCACTGCATT 660 

ATTAAAG CCA AACAATACGA TGGCGCCTGC AATTGGTGTT GCCATACCTT TGACACCTAT 720 

20 TACTAGTCCG CTAAATGTCA CGATACATGC GTTGACGACC CCAATCAGTG CATTTGTACC 780 

ATATAGTTGT A C TGG AT ATT GCGCTATTAA ATCAATTTGC GTCAATGGCT CAATACAAAC 84 0 

TGCAAATGCT TTTGACGGTC CACCAATGTT TAATTTTCGG AATAAAATAA GGTTAACAAA 90 0 

25 TGAGCTACCT GTACATGTTA GTGCTCCAAT AGCCATAGGA ACACCTGTCA GTCCTAATAA 96 0 

ACTTGTTAAT ACCATTGAAC TTAGCGGTGT CATACCTGTA ACAGGAATCA CTAGTCCTAA 102 0 

AATGACCGCT AATGCATATG GATTGTTATC ACCTACCGCA GTGACAGCAC TACCTATTTG 10 80 

30 TTTTAATGTT GCTAGCACAC CAGGTGTAAT GATTGATGCA AGTCCGAAAG CAATTGCTGG 114 0 

TGCAAATAAG ATCACCACAA TTAAGTCCAA GCCTTCTGGA ACTTTCTTTT CAATCCATTT 12 0 0 

AATTAAAAAA GcTACGCCAT AAGCTGCGAT GAATGCTGGT AATAATTTAA AGTCATGTAA 1260 

35 

TACTAAACCA ACAATGACCG CAAATACTGG TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 13 20 

ACCTACTGcG ATACCGCTTA AACTTCCTGC TAAATCCCCA ATAT CTTGTA GAAATTTAAT 13 80 

ATCAAATACG CCACCAATAG CATAACTTAA GAATGCTTGT GGTAGAAATG TCGCACAAGC 144 0 

40 

TGCACCGGAT AATGCTTGTA GTCCTTGTTT ACCGTACGGT GCATACTTTA AAAATAGCGT 150 0 

CATGATCACT AAAACCAAGA CTAATGTGCC TACACCTAAC AGAATATCCA TTTCCCAAAA 156 0 

45 ACCTCTCTCT ATGTTTATTT TATTTTCAGA CCATAAACAT CGTACACCCT TAAGAAAACG 1620 

TTTTCAACTT TTATCTGTTA TCAAATCaAA TATTTAAGTG AAATATTTCA TATTTGTGAA 16 80 

AGATTTTTAA AATGGATTGT TTCAAAAACT ACTTATGTTG TCGTTAATAT TTACTAATTA 174 0 

so ACTTTTTACT CTATATTTCA AACAGTTGTG TGACAGTTTT TTGATAACTT TTTTACATCT 1800 

GAAAGTAAGT AATATTTCTA AAAACTTTTA ATATTTATAC ACTTTATCTT TCGAGCTATT 186 0 
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ACACATATAT TTGCAATAAG ATAATTAAAG TAGGATATTA TTTTTAGTTT TCTGATAGGA 198 0 

ATGATGATAG TCTATAGGTT GAATCTTTAC TTTTTTTTAA AGCTAAATTT ACATCAACTT 204 0 

5 AACAATGGTT GGTTATACTG AAGATGAAAA TATTTAGTAT AACTTAGTGG AGG C GAT AAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAGCAC TTATCTTTAT 2160 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 222 0 

10 

ACATTTCGTG TTAGATAGTC AG CATTCT CA TCATGGATCT CTTACGCAAA AACATTTGAG 22 80 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA T r rTCCCATAT 2 34 0 

CGATTCATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 24 00 

15 

TCACTCTTTC TTAAAAGT CG ATATAATGAA TTAAATCATT AT CAT ACA C C GACATATTTT 24 60 

ATGTTGTTGG TGTTAAGTTT aAAGGGGTGA GATACTTGGC GAATaATCaT TCAGCTTT 2518 
OQ (2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: I 
30 ATACTACTGG ACCGTCnACC AATTGACAAA 

ACATCGAACC TGGAAGAACA ATCCCCAGGA 
CAAGGTAGTT TAGATAAGAC AATTTATGAA 

35 

CAGAAAGATA TTACGGTATA TAATAAGACG 
GACGAAAGAA TAAAGGATGA TTAAATGAAT 
GCAATTGACA ATACCCCAGG CGAAGATACG 

40 

ATGACAGATT ATTTTGTTGT AACGCACGGA 
AGAGCGGTGA AAGAAGTAGC CAATGAACAA 
AATGAAGCGC GTTGGATATT AATTGACTTA 

4b 

GACGAAAGAA ATTATTATAA TATTGAAAAG 
AGTCAGGTTG CGTATTAATT ATGTCGCAAT 
50 TG ACT C AAG A TCAACCATAT GAAAAATGGT 

AATCAAATAT TTTAGATATT GGATGCGGTA 

55 



EQ ID NO: 3 70: 

ACCTGAAAAC TGGATTTTTA TTGCAGGATT 6 0 

GTTGATGATA T TCG AG AT AT GGCATACAAT 120 

ATTTCTAAAC GCAGAGTACT ATTTTTAATA 180 

ATTGACTGTT TAAATTATTA TAACTATAGT 24 0 

TCACAAGAAT TATTAGCAAT TGCTGTGGAT 300 

ATTTCTTTAG AAATGAAAGG TATCAGCGAT 36 0 

AATAATGAAC GACAAGTTCA AGCGATTGCT 42 0 

AATATAGAAG TAAAACGTAT GGAAGGATAC 4 80 

GCTGATGTTG TGGTACATGT TTTCCATAAA 54 0 

TT AT AT CAAG ATGCAC CATT AGAATCATAT 6 00 

ATGCAGAAAT GAGCCTAGTG TACGATCAAT 660 

TTGAAATTGT AAAAAATCAC TGCAAAGATG 72 0 

CTGGTAGTTa ACAGTTCAAT TAGAAGCTTT 780 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1823 base pairs 
D (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

w 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 6 0 

GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 120 

15 

TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 18 0 

TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 24 0 

20 AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 300 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 360 

AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 420 

25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 4 60 

AT ACA CATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AAT AC CACAT ACATATGCAC 54 0 

TTCAAAATAG TAATGATTAT tTAGAAATTA TGGAAGAAGG AATATCATAT ATAGTACGTG 6 00 

AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 660 

TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 720 

CACATGCGTA TGTGAAACTT TGG AAA CATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 780 

35 

ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATGGATAT AATGTT AG T A 84 0 

GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCAC CAGAA ATTATGGAAA 900 

AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATTAACT AATAAAAATG 960 

40 

TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTTTCATT 1020 

ATGATTTATT TGATAAAATA GACCCCAAAT TAT CAAAAG T AATTCAAGAT AAAGTATCTG 1080 

45 CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 114 0 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 120 0 

TAGG TAT TGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 126 0 

SO ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 132 0 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 13 80 

55 
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10 



20 



CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAGGTGAA AGTGGGCTCA 1500 

TTGCTCTTGA TTGG CAT AAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT 1560 

GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TT AT AG AG Cm TATTTaGaAG 162 0 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 

aAAAGGtATT TGCaTGTGGc gGTAt ACCt A AAAAGAATGC TGTTATGATG GATATCTATG 1740 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCACCA GCAATAGGCG 1800 

CAGCAATATT AGGTGCAGTC AGT 182 3 
(2) INFORMATION FOR SEQ ID NO: 372: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

25 ACGATCATCT GCATCAGCGA ATTCCGATGC AaTTcATATG tCCtAATAAC AAAAGAATAG 60 

GTTTTaAAAG ATATGCATAC CGTAATGaTA GATATAGTTT TAAACGTGAC TTCAAGCTAT 120 

ATGaATGTGA TGACTGTTCA TCATGTTCTT TGAGACATCA ATGCATGAAG C CAAATT CG A 18 0 

30 AATCCAATAA GAAAATTATG AAGAATTATA ATTGGGAATA CTTTAAAGCC CAAATT AATC 24 0 

AAAAGCTTTC TGAACCAGAA ACGAAAAAAT CTATAGTCAA AGAAAAATTG ATGTAGAGCC 300 

TGTTTTTGGA TTTATGAAGG CTATTTTGGG TtTCACTCGA ATGTCAGTTC GGGAATAAAT 360 

35 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA ATGG CACTTA ATATAAGGAA AATAgcaGCT 420 

CAACGAGCTG TACATTATAA AATACATATC AAAAAAGCTG ATTTCTATCA AATAAATAAT 4 80 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA CTTTAAGGAA CTTAATGTCC CAAGCTCTTT 54 0 

40 

TTTGTTATAT CTAATTCGTA ATTTATGATT GTTTATTCGG TCCTTTGATG TTCACTAAAT 600 

GTGACTTTAA ATCTTGTTCT AATTGTTGTA ATTCTTTTTC AG CTAATTGT CGTTCTTCGC 66 0 

GACCGTGTTG TTGAATAATT AATGTTTCTT CAATTGTCTC AATAATGTTA CGCTGTGTAC 72 0 

45 

GTTTCAATGT AT CAAG AT CA ACAATGCCAC GCTCATTTTC TGTTGCAGTT TCAATCGCAT 78 0 

TTTGTTTCAA CATTTCAGCA TTTGCTGTTA ATAAATCATT AGTTGTATCA GTGACAGCTC 34 0 

50 GTTGTGCAGC AACTGCATTA CGCTGTCTCA TTAATGTAAG CGCAATGGCC ATTTGATTTT 900 

TCCATAGTGG AATACTTGTC AAAATTGAAC TTTGTATCTT CTCGGCAAGT GCTTGATTAA 96 0 

55 
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AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 10 80 

TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 114 0 

5 TAATTGCAAC TTTTTTTGcT GTGCAG CAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 1200 

TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT C TTGTTAAAT GTGTTTGATG 126 0 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

10 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTT AA CAT AG ATGGTTTATC 13 8 0 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 144 0 

TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 150 0 

15 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 156 0 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 1600 
20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 ( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

in 

ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 6 0 

aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA 12 0 

ACAGAAACCA AGCAACAGAT CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 180 

35 

ATGGACCTGC CATTGAAAAT ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 24 0 

CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 3 00 

CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAAT CAG 36 0 

40 

TTAAGCAAAA TGCTGGTAAT GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 4 20 

ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 4 80 

45 CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG T AC TTATAAC AACGCGCATT 54 0 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAA CAAT CGA ATACGATCAT ACTAGCCATA 6 00 

AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 660 

50 CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 72 0 

CTGGTACTTA CGGTGCTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 780 
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w 



20 



35 



40 



45 



AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG CCAACAAGAA TATCCACAGT 900 

TCTTATATGA AAATGTGATG TCAAAAATTT CAT CTAG AG A TGGCGAAGAC TTTAAACGTC 960 

CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGTTTC TGGTAGTCAA GATAACAACA 102 0 

CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAGCAGT GGTACTGCAC 1080 

AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA ACAGGTATAT 114 0 

TCAACTAATC CACTCAACAT AAAATCCTCA GTTATACCAT ATTTATGGTG TAGCCGAGGA 12 00 

TTTTnTTAGG TTCTTCATCT TTTATGG 12 27 
(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CCATATGGtG CAACATTCTT CGTATTTAGT GATTATTTAA AACCAGCGTT ACGTTTATCA 60 

TCAATTATGG GATTAAATgC aACGTTCATC TTCACACATG AyTCaATTGC AGTAGGTGAA 120 

GATGGTCCTA CTCATGAACC AATTGAGCAA TTAGCTGGAT TAAGAGCCAT TCCAAATATG 180 

AATGTTATCC GTCCTGCTGA TGGTAATGAA ACAAGAGTAG CATGGGAAGT TGCCTTAGAA 240 

TCTGAATCTA CACCTACTTC ATTAGTATTG ACACGTCAAA ACTTACCGGT AT T AG ATGTA 3 00 

CCAGAAGATG TAGTTGAAGA AGGCGTTCGA AAAGGTGCCT ATACAGTTTA TGGCTCTGAA 3 60 

GAGACACCAG AATTCCTATT ATTAGCTTCA GGTTCAGAAG TTAGTCTTGC AGTTGAAGCT 4 20 

GCTAAAG AT C TTGAAAAACA AGGTAAATCA GTACGTGTTG TTTCAATGCC TAACTGGAAT 4 80 

GCATTTGAAC AACAATCTGA AGAATATAAA GAAT CAGTTA TTCCATCAAG CGTAACAAAA 54 0 

CGTGTTGCGA TTGAAATGGC TTCACCGCTT GGATGGCATA AATATGTAGG TACTGCAGGT 6 00 

AAAGTTATTG CTATTGACGG CTTTGGCGCA AGTGCACCTG GCGATTTAGT AGTTGAAAAA 6 60 

TATGGATTTA CAAAAGAAAA TATCTTAAAC CAAGTTATGA GCTTATAAGA ATAATTTATA 720 

AAGCGAGTAT GTTTAGAAGT CTAGGATGCA TAATCTTAGG CTTCTTTTAA AGTGTTGAAA 7 B0 

TTTAGAGTAT AG C ACTT AAA CTACATCATA AGTGATAAGT TATGAAAGTA TACTATTTCA 840 

SO GATTAATCTT TAAAAGCTCT GTTATAACAG CATGATTTTT GATATTATTT TTAGTATCGA 9 00 

TATTAAAATA CTT GAAT AAA CTAGTTCTTG AAATAATGTG ATGAATTTAG TAAAATTCAG 96 0 



55 
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TGCATTAATT TTAGGTTTAA TTGGAGGTTT CCTTTTAGCT AGAAAATATA TGATGGACTA 1080 

CTTGAAGAAA AACCCACCAA TCAACGAAGA AATGCTTCGT ATGATGATGA TGCAAATGGG 114 0 

5 TCAAAAACCT TCTCAGAAGA AAATTAATCA AATGATGACG ATGATGAATA AAAATATGGA 12 00 

TCAAAATATG AAGAGTGCGA AAAAGTAAAT TCGCAATTGA TAGAGGCTAT TTTCCAGATA 126 0 

TGGAAATGGC CTCTTTTTAT AATCAAATTA ATAAGAATAA ATATGTTTAT TAAAATTAAA 132 0 

w 

GTTAACAAAA TGACGAATAG ACTGAGAAAT GCTATAATTC ATTTTGTATG ATTTACAGAG 13 80 

AGTTTATTTA ACGAGAAGGT GTCyGCGTGC TCTATTTAAT ATTTTCAATC ATTGTAGCTT 144 0 

TATTTATGGG AACTATAGTT ATAGTTATTC GTATGAAAGC TCAAAATTAT CCGGTAAAkG 1500 

15 

AGAAAAAAAT AGTTTTGCCa CCgTTTTTTA tGGCgACCGG TGCATTGATG TACGTCGTTC 1560 

CaTATTTTAG GCTAACAGGA TCGGAAATGC TAGAAGCCTT T AT AATTGG T TTGCTTTTTT 1620 

2Q CtACAGTTCT AATTTGGACT TCTCGATTTG AAGTCAAAGG TACAGAAATT TATATGAAAC 1680 

GATCTAAAGC ATTTCCAGTT ATTTTGATTT CATTACTTAT CATTCGTACT GTGATGAAAA 174 0 

TATTCATTAG TAATGAAATA GATCCTGGAG AATTAGGCGG CATGTTCTTT TTATTAGCAT 1800 

25 TCTGTATGAT TGTTCCTTGG AGAGCAGCAA TGCTATATAA ATACAAAAAA CTAAAGAAAA 18 60 

CATTAATCAA TTAATTACTT TTaAAACCAC TTGTGATCGA CTTCTAAATC AGTCAATGAG 192 0 

TGGGTTTAAT nTTACTTGGA AAAGGnGGAA AGG 19 53 

30 (2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3787 base pairs 

(B) TYPE: nucleic acid 
35 (C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: I 

40 

ACATTTGATC AAnTTATCGA CATTAAAGAT 
TGTTGAAGTA GCACGTTTGC TTGATATAGT 

45 TATCcACGTT GATTAAAGAT AAAGGGAAAA 

CTGAAAATAT TGATGGCGAA GTGCTGTTCA 
AGGTTGGTGT TCAAAATAAT GCAATGrCAA 

50 ATAGTTTGAA GTTTTTAGTT AAGTGCATTG 

AAAGAAGCAT TTAATGG CGT TGTCGTGTTA 

55 



JEQ ID NO: 375: 

GAATTCAnTT GATCGTnTCA ATGATTATCC 60 

GGAmATaAAA GTACACGCAT TACATTCAGG 120 

TAATTGATAT TCATTTATCT GTAAAAGCCA 130 

AAGCAACACA ACCTTTAGGT AGAACAATGA 24 0 

TTACTTTAAC GAAACAAAAT CAATGGCTTG 3 00 

AAGAAAGTAT GAGAATCAGT GATGAAGCAT 360 

ACTGCTGCAT TAATTGTCAT TAAAATTCTG 42 0 
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CAACAAGTGT ATCCAATTGT AGCATTAGGA 
GCAATTACAC AAAATATAGG GAAGTATCAT 
5 TATATACAAT TAGTTGGTAT ATTATTATTT 

GCACATATGA TGGGTGATGG CCATTTAACA 

ATATTTATAG GTATGCTTGG CGTGTTAAGA 

w 

GTTCCGGCTA TTTCCCAGGT TATAGAACAA 

ATTGTTATTT TTGTAGACAG AGGTTGGACG 

GCATCAACGA TAGGTTTTTT AGGTTCTTCA 

IS 

TTTAAAATGG TAAATAACAC TGCAAAGATC 
ATTTTCGCTA TCAGTCAATT AATCGTAATT 

2Q ATTAAGTCAC TTCAAGCGAT ACGCGTGCCA 

TATGACCGTG GTGCATCATT TATTCAGATG 
GCGCTCATTC CTCTGTTAAG TGACGCAATC 

25 TATGCAAATG CGTCATTAAA GATTACGATT 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG 
TTAAGTGTTT ATATGATTAC GGTCATTTGT 

30 TTACAAG CGC AACATGCTGT GAGACCTATT 

AAATTTATAC TTAATATCAT TTTGATTCGT 
ACTGTTGTAT CATTAATTAT ATTCGGTACG 

35 

CACTTATATG CGATGAGACG ATTTTTTATC 
ATTGTTGTTC AATGCGTGTT AAA CAT AG TG 
GAATTATTAT GTGCAGCAGT ATTAGGTATC 

40 

AATGTTTTGA CATATAAAGA GTTAACTTAT 
AAGAAAGGAA GACGTTGATG GCACATACCA 

45 TTGATGATTT GCCGCTAGGG ATATATAAAT 

GAACGTTAGA TCATCCAGTT ATAGAATCAT 
ACCATGTTTA TGAAGCACAT AACCAATTTG 

SO TGGTTGAAGC TGCTAATGAA AAAGATATTG 

CTGAGACAAC TACAGTGAAA TTACTGGCTT 

55 



ATGATATTAT CGATGAATGC CATTCCTAGT 54 0 

AGTGACGAAG CATATGCAAA AGCAGTCGCT 600 

ATTGCTATTT TTGTGTTTGC GAACAATATT 660 

CCAATGATTC AAGCTGCAAG TTTAAGCTTT 720 

GGTTATTATC AATCTGCAAA TAATATGACA 780 

GTTATACGAG TAGGTATTAT CATTGTTACT 840 

ATATATGAAG CGGGAACAAT TG CTATTTT A 90 0 

ATTTATTTAG TAGCGCACCG ACCTTTTAAG 960 

GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 1020 

TTATGGCAAG TGATTGATAG TGTTACTATT 10 80 

TTCGATGTTG CCATAACTGA AAAAGGAGTC 114 0 

GGATTGATTG TAACTACAAC ATTTAGTTTT 1200 

AAAATGAATA ATCAGGTACT TATGAATCGT 1260 

TTAATAAGTA CAGCAGCGGG AATAGGATTA 132 0 

TTTTTTAAGA CGAATGATTT AACCTTAACG 1380 

GTATCGTTAA TTATGATGGA TATGGCATTA 144 0 

TTTGTTGGTA TGACGGCAGG ATTGGTTATT 150 0 

TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1560 

ATTATCCATA TTGCTGTCAC GAGAAAATAC 162 0 

AATGTTGTTT TAGGTATGGT ATTTATGTCG 1680 

ACAACACACG GTAGAATCAC TGGACTCATT 174 0 

ATTGCATTGT TTTTCTATAT TTTTAGATTT 1800 

TTACCATTTG GTTCAAAGTT GTATCAAATT 1860 

TTACGATTGT TGGCTTAGGA AACTATGGCA 192 0 

TTTTAAAGAC ACAAGATAAA GTTTATGCAA 1980 

TGCAAGATGA ATTAACATTT CAGAGTTTTG 2040 

AAGATGTCTA TATTGATATT GTGGCGCAAT 2100 

TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 216 0 

TAGCAAAGGA CAATACTGAT ATAGATGTGA 2 22 0 
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w 



ATGATGGCTT CACACTGTTA GATGCGACAT CATTACAAGA AGTAACACTT AATGTTAGAA 234 0 

CGCATACATT GATTACGCAA GTTTATAGTG CAATGGTTGC TGCTAATTTG AAAATCACTT 24 0 0 

TAATGGAACG ATATCCTGAT GATTACCCTG TTCAAATTGT CACTGGTGCA CGAAGCGATG 24 6 0 

GTGCGGATAA CGTTGTGACA TGCCCATTAT ATGAATTGGA TCATGATGAA AATGCATTCA 252 0 

ATAATTTGAC GAGTGTATTC GTACCAAAAA TCATAACATC GACATATTTG TATCATGACT 2580 

TTGATTTTGC AACGGAAGTG ATTGATACTT TAGTTGATGA AGATAAAGGT TGTCCATGGG 264 0 

ATAAAGTGCA AACGCaTGmA AcgCTAAAGC GTTATTTACT TGAAGAAACA TTTGAATTGT 270 0 

TCGAAGCTAT TGACAATGAA GATGATTGGC ATATGATTGA AGAACTAGGA GATATTTTAT 276 0 

TACAAGTGTT ATTGCATACT AG TATTGG TA AAAAAGAAGG GTATATCGAC ATTAAAGAAG 2 82 0 

TGATTACAAG TCTTAATGCT AAAATGATTC GTAGACACCC ACACATATTT GGTGATGCCA 288 0 

ATGCTGAAAC TATCGATGAC TTAAAAGAAA TTTGGTCTAA GGCGAAAGAT GCTGAAGGTA 2 94 0 

AACAGCCAAG AGTTAAATTT G AAAAAG TAT TTGCAGAGCA TTTTT TAAAT TTATATGAGA 3 000 

AGACGAAGGA TAAGTCATTT GATGAGGCCG CGTTAAAGCA GTGGCTAGAA AAAGGGGAGA 306 0 

GTAATACATG AGATTAGATA AATATTTAAA AGTATCACGG TTAATAAAGC GACGTACGCT 312 0 

AGCAAAAGAA GTAAGTGATC AAGGTAGAAT TACAA TAAAT GGTAATGTTG CTAAAGCTGG 318 0 

ATCGGATGTT AAAGTTGAAG ATGTGCTGAC GATTCGCTTT GGTCAAAAAT TAGTAACAGT 3 24 0 

TAAAGTAACT GCATTAAATG AACATGCATC TAAAGATAAC GCGAAGGGTA TGTATGAAAT 3 3 00 

CATTGAAGAG CGTCGACTTG AAGAAG CGTA AATTGGAGGT GACAAG CAAT GAAAAATAAA 336 0 

GTAGAACATA TAGAAAATCA GTACACGTCG CAAGAGAACA AGAAAAAACA ACGTCAAAAA 34 2 0 

ATGAAAATGC GTGTTGTTCg TAGGCGTATT ACAGTATTTG CGGGCGTATT aCTTGCGATA 34 BO 

ATTGTTGTTT TATCaATCTT GCTTGTTGTC CAAAAACATC GCAATGATAT TGATG CACAG 354 0 

GAGCGAAAAG CGAAAGAAGC ACAGTTTCAA AAGCAACAAA ATGAAGAAAT TGCGTTAAAA 36 0 0 

GAAAAGTTGA ATAATCTGAA TGACAAAGAT TACATTGAAA AAATTGCGCG TGATGATTAT 3 66 0 

TACTTAAGCA ACAAAGGTGA AGTGATTTTT AGGTTGCCAG AAGACAAAGA TTCGTCTAGC 372 0 

TCAAAATCTT CGAAAAAATA AATCCAAATT GATTCAAAAT TATCCGAGTA TAGACATTGT 3 78 0 

GAAAAAA 3 78 7 
(2) INFORMATION FOR SEQ ID NO: 376: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



35 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 



10 



15 



20 



35 



40 



45 





pi j j /— »t»"> a r*r~> 
LAnAL 1 LjALtvj 


A A pp Pr-i A A P A 
AAV- vi L. IlAAvj A 


LVj\JO/V».y All 


PPAPATT^TAG 


AAGTATTTAA 


60 


AAA 1 1 o 1 




A t p. a p A T A A P 
A 1 vjALA 1 nHL 


P/TPAPTTTAT 

VJ 1 V-r\V— A A 1 1 


A PTTTTAAG C 


AACCTGTATC 


120 


ALCAv_AGL, 1 1 


LrL-Al 1 lAnAft 


TP 7A PAP ATP A 


A A TTTTTPTS 


A ATAAP.PAPP 


HTH T ATT AG A 


180 


T AAGG 1 AAAA 


G TTTTAG ATA 


TA PP R a TTTP TA 

AGGAA1 i ILtA 


TTTTITPTT a 


A TTP. A PP.PTP 


PTPPPPGAAT 

V, A O V_J O VJ U/i/1 X 


240 


TG C CG T AC CA 


A T" A T A TP A A P 

ATATATGAAG 


G 1 A GAGA 1 GA 


1 1 1 v- 1 Av»A 1 


APT A A APATP 


TAATPAATGA 

1 /Vrt 1 ^/VrA A VJ/A 


3 00 


TTGTGCAGAT 


TGTGTCATCA 


Gl G 1 G 1 1GGG 


a tpi a a a tt a 
Ai L-AAAAl 1 A 


an tp r*T a tt a 

VjVj luL Inl 1A 


PPPATPPP AT 
vjV_vjM 1 vjLI — r\ I 


3 60 


TGTTCACCAA 


GATTATGTTA 


ATCAG AATG T 


A I GGGCGAG 1 


A A ' 1 "l'T'1 ™|«T» A A 

AA1 1111 1AA 


t a a tp a a tpp. 

1 AA 1 ijrtn 1 v- L 


4 2 0 


CTATACAGAC 


» ATA TTP 

AGCTATATTG 


AAAAAPAPAA 

aaaaagalaa 


TPn a aTP a HC 
I LAM1 LjAL-Lj 


aTTPr.fi A A AT 


TA APA A AT A A 


4 30 


AACAGTCTAT 


ACATTTGAAG 


AACATGCCAC 


GTA I GAAAA 1 


TTr'TP A C* A A P 

1 1 G 1 C ALj AAvj 


p a ttttt a a A 

Ln 11111 /Vrvrt 


Jt u 


ACAATTAATA 


GGAGTTAAAA 


ATG AATT A CA 


GAGAALAAG 1 


T TV AAPAAAAa 

1 AAAGAAAAA 


P B PTPBP A AT 
bnL 1 L^/Avj-MJ-V 1 


6 0 0 


ATGTTTGGCA 


TCCATTTACA 


CAAATGGGTG 


TATnTftPPTi a 

I A I A I AGGAA 


AvjAAvj/\MvjL_rt 


ATP ATP ATTP. 
<rt 1 L_/\ 1 Lrt. 1 1 \J 


660 


AAAAAGGAAA 


GGGTAGTTAC 


CTTTACGATA 


/"> TV TV T'/*' /~* -v TV 

CGAAIGGCaLA 


T A A ATA TTT A 

i AAA 1 A 1 1 1 A 


P A TP.PTT ATP 
uM 1 LjO 1 lAlu 


7 o n 


CATCGTTGTG 


GGTCAATGTG 


CATGGT CAT A 


TVT'TVTV^'JV A Ti *"P TV 

A 1 AAGAAA 1 A 


PTTm A ATA BP 
LI 1 liAA 1 AAvj 


PTA ATT A A A A 
Vj 1 /\A 1 1 I\*\r\r\ 


7 80 


AGCAACTCAA 


TAAAATTGCC 


CATTCTACGC 


I GC 1 AGGA I G 


A TP A A RT&TT 
Ai GAAAI A I 1 


PPPTP A AT AP 
v_l— vj 1 Lnn 1 /\Lj 


O *± VJ 


AACTTGCGGA 


AAAATTAATC 


GAAAI CACGL 


LAAli 1 AA i. G 1 


a apaaaamTa 

AAoAAAALs 1 A 


TTTT A TT PTP 
ill 1 A 1 ILlo 


900 


ATAC AGG C AG 


TGCGTCTGTT 


P A A TV TfTPTi 7A 

GAAA 1 L.Lfv_AA 


TRRRP A TPPP 
1 AAAvjft 1 bVjL 


ATA TP APT AT 


TPPAAAAATA 


960 


TTGATAGAGA 


AAAA I A I GG G 


AAvjAAAAAL-A 


APTTTATA Ar* 
ALj 1 1 Inl nn\* 


P.PTAAATPAP 


PtGTT AT CATG 


1020 


GGGATACGAT 


1 GG 1GGGG 1 A 


TV P TPTTPjT TP 
AG 1 G 1 A 1 L? 




PTTTPAT AA A 
LI 1 1 L-ttlrtrtrt 


AT ATTT A A AC 


1080 


ALI IAAIAI I 


r-nj-> 7\ O T\ ATA TT 

IGAGAAaAI i 


pa aPTap a a a 


(T"*PP ftTPTTT 
VjLLLAI v-1 1 1 


PTATPP,PAPtT 

V_ A rt A LVJV-rtVJ A 


AATTACGATA 


114 0 


PTP 7\ A J\ A TP" 1 » 

CTGAAAATGA 


ft TV TP A TP A P A 

AA 1 G A 1 GAGA 


/-tm tv 1 1 m I m i w 1 » TV » 

L»L, 1 Ai 1 1 1 AA 


PP. A ATATAP.A 


OPJLAATTCTA 


ATTGAAAGAA 


1200 


ATGATGAAAT 


CGCAGGGTTT 


ATATTGGAAC 


CGTTGATTCA 


AGGTGCGACA 


GGCTTGTTTG 


1260 


TTCATCCTAA 


AGGCTTTTTG 


AAAGAAGTCG 


AGAAATTGTG 


CAAAAAATAC 


GATGTCTTAT 


1320 


TAATTTGTGA 


TGAGGTAGCA 


GTTGGTTTTG 


GGAGAACTGG 


AAAGATGTTT 


GCATGCAATC 


1380 


ATGAAGATGT 


TCAACCGGAT 


ATTATGTGTT 


TAGGTAAGGC 


GATTACTGGT 


GGCTACTTAC 


1440 


CACTTGCAGC 


TACATTGACA 


TCTAAAAAAA 


TATACAATGC 


ATTTTTAAGT 


GATTCGCATG 


1500 


GTGTGAATAC 


CTTTTTCCAT 


GGTCaTACAT 


ACaCCGGAAA 


TCAAATcGTT 


TGTaCGGTTG 


1560 


cATTaGaAAA 


TATaAGaCTT 


TATGaAAAAC 


GTaAGTtnAT 


TGTgCACATa 


TTGaAACGaC 


1620 
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(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 60 

TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 12 0 

15 

CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 

CACGCmTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC 24 0 

20 AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TGTTAGGTAA TCGTCACCTC 3 00 

CACAAAG AG C GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGGCAATACG 3 60 

aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 42 0 

25 CGAGCTTCAG T 43 1 

(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 

[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

TTTnTATAAC GTATTATAAA TCGTTAAAAA TTTTGGTTGT GTTTGCGTCA CGTAGACAAC 6 0 

CTCCATAAAG TTACTTAATC ACTCTCATCA TACAATAATT TTTACTCAAA TTGGAAnAAT 120 

40 

TATAAAAATT AAATATAGAT AGGCTTTGAA AATTAGTTTT ATACAAGGTT AGTAGCTGTA 180 

ACTGTAAAAT GTTCTTAATA TTGTCAAAAT GTAATGCTTG AAAGCGCTTT TAAAaAATAT 24 0 

4S TATTATATAC ATGGTTAGAC AAATAGACAA ATCACTATAC AAATATTGGG AGGAATATTT 300 

TATGAAATCA ACACCACACA TTAAACCAAT GAATGACGTC GAAATTGCAG AAACGGTTCT 360 

ATTGC CAGG A GATCCGTTAA GAGCTAAGTT CATTGCAGAA ACTTATTTGG ATGATGTGGA 42 0 

50 ACAGTT CAAT ACAGTGCGAA ACATGTTTGG TTTTACCGGA ACATATAAAG GTAAAAAAGT 4 80 

TTCTGTCATG GGTTCAGGTA TGkGTATGCC ATCTATTGGC ATTTACTCTT ATGAATTAAT 54 0 

55 



1272 



EP0 786 519 A2 



CATTGATTTA TATGATGTGA TTaTTkCACA AGGTGCCTCT ACTGATTCAA ATTACGTTCA 660 

ACAATATCAA TTACCAGGTC ATTTTGCGCC AATTGCTTCT TATCAATTAT T AG AAAAAG C 72 0 

5 AGTTGAAACA GCACGTGACA AAGGTGTACG TCATCATGTA GGTAATGTGT TATCAAGTGA 780 

TATTTTCTAT AACGCGGATA CAACAGCGAG TGAACGTTGG ATGCGTATGG GTATTTTAGG 840 

TGTAGAAATG GAATCaGCTG CaTTATACAT GaATGCAaTT TACGCTGGTG TCGAAGCATT 90 0 

10 AGGTGTGTTC ACAGTGAGCG ATCATTTAAT TCATGAAACG TCAACAAcAC CTGAGGAAAG 960 

GGAACGTGCA TTTaCAGATA TGATTGAAAT TGCACTGTCA TTGGTGTAGA TGATTATGAA 1020 

TGTTGAATAT TCTAAAATAA AGAAAGCAGT ACCTATTTTA TTATTCTTAT TTGTATTCAG 10 80 

15 

TTTGGTTATA GACAACTCAT TTAAATTGAT TTCTGTAGCC ATTGCTGATG ACTTAAACAT 1140 

ATCTGTAACG ACAGTAAGTT GGCAAGCGAC ATTAGCCGGT TTAG TAATTG GTATTGGCGC 1200 

TGTAGTATAC GCTTCATTAT CTGATGCCAT TAGTATACGC ACACTATTTA TTTATGGCGT 126 0 

20 

GATATTAATC ATTATCGGAT CAATTATTGG TTACATTTTC CAACATCAAT TCCCATTACT 132 0 

TTTAGTTGGA CG T ATT ATT C AAACTGCCGG TTTAGCTGCT GCAGAGACAT TATATGTGAT 13 80 

ATATGTTGCA AAGTATCTTT CTAAAGAGGA CCAGAAGACT TACCTTGGCT TAAGTACGAG 144 0 

CAGTTATTCC TTGTCATTAG T TAT CGGTAC ATTATCAGGT GGATTTATTT CTACGTATTT 150 0 

ACACTGGACA AATATGTTTT TAATTG CATT AATCGTAGTA TTTACGTTGC CATTCCTATT 156 0 

30 TAAATTATTA CCAAAAGAAA ATAATACGAA TAAAGCTCAT TTAGATTTTG TTGGCTTAAT 1620 

TCTAGTGGCA ACTATTGCTA CAACAGTCAT GCTGTTTATT ACGAACTTTA ATTGGTTATA 16 80 

TATGATTGGT GCCTTAATTG CGATTATCGT TTTTGCGCTA TATATTAAAA ATGCGCAACG 174 0 

35 TCCATTAGTA AATAAATCAT TTTTCCAAAA TAAACGTTAT GCTTCATTTT TATTTATAGT 18 00 

ATTTGTAATG TATGCTATCC AATTGGGTTA TATTTTTACG TTCCCATTCA TAATGGAGCA 185 0 

AATTTATCAT CTGcAACTAG ACACAACATC ACTGTTATTA GTACCGGGgT T a TAT AG TAG 192 0 

40 CAGTCATTGT TGGtGgCACT AAGTGGgTtA AAATCGGgCG rAATATCTGG AATTCCAAAA 1980 

CCAAGCGGAT TATCACAGCC AATTAA 2 0 06 
(2) INFORMATION FOR SEQ ID NO: 379: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 
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TGGGGAAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATATTTCTAT TGAAGAAATG go 

AATCGTATCA TAAAATTAGT CGAAGAATTA CTTGAATTGA CTAAAGGAGA TGTAAATGAC 12 0 

5 

ATTTCTTCTG AAGCACAGAC CGTGCATATT AATGATGAAA TTCGCTCGCG AATACACTCA 180 

TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 24 0 

ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTTTTAA TCTTTATTGA TAATGCAATC 300 

10 

AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA 3 60 

ATAATTGAAA TTACAGATCA TGGAATTGGT ATTCCAGAGG AAGATCAAGA TTTCATTTTT 420 

;5 GATCGCTTTT ATCgAGTGGA TAAATCTCGT TCAAGAAGTC AAGGCGGTAA TGGACTCGGA 4 80 

TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTGAA 54 0 

ATTAACAAAG GAACAACGTT TAAAATCATA TTTTAATCAT GACTGAGACG TCAATCAAAG 6 00 

20 TCATAGGATC AATTTTTTAA GTACACATTA GCTGTGACTA ATGTATAAGA ACAACTATAA 660 

AACAAATAAA CAGTGGTTCT TTATCATTTC TGTTGTACTC CCAAAATTTA CAATAAAATA 720 

CATC TAT AAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTTTTTAGC AGATAAATAG 7 80 

25 

GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG CGTTTCAACA 84 0 

TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTTTCAGAGG CTCCTGTAAA 90 0 

CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 96 0 

30 

ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 102 0 

ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 1080 

35 TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 114 0 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAAGAC TTTGATTTAG ATCAACAGAC 12 00 

TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATGATAATGC 126 0 

40 TTATGAAGCA ATTTTAAGAA TGGAAAAACG TTACAAAGGA CCAATTGCAT TTGAGTATAC 132 0 

ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 13 8 0 

AGTAACGTTA AATAATAACG AAAAAAGGGC ACTATTCAAA CAATTAGCGT ATGTTGAAGG 144 0 

GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 15 0 0 

AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 156 0 

AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTTTTAACGC ATGTCTTAGA 16 2 0 

AAAACCGTAC GAAATGATGA TTTCAGAATT TATGCATACA GATCCAATGA AATTCTTACC 1680 

TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATGTGAAAT ATCACCTTGG 174 0 



45 
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AAGTCACTTG GAAATTGTTG CACCTGTTGT 
TACACAACGA GCTGGGGCTC CGACGACTGA 
5 TGGCGATGCT GCTTATCCTG GTCAAGGAAT 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA 
TACAGAACCA ATTGATGCAC GTTCAACAAC 

w 

TGTGCCAATA TTCCATGTCA ATGCAGATGA 
TGCAATGGAA TTTAGAAAAG AGTTTCATAA 
TCGTTTCGGA CATAACGAAA TGGATGAACC 
TATTCGCAAA CATGACTCTG TTGAATATGT 
CATTTCAGAA GATGAAATGC ATTCATTTAT 
20 TCATGATAAA ATTAATAAAG CTGATAAAAT 

TCTTGCATTA CCGTTACAAG CAGACGAACA 
AAATGATGCA TTGTTAACAT ATCCGGATGG 

25 

TCTTGAGAAG CGTCATGAGC CGTTTAATAA 
AGAACAACTT GCATTTGCGA CAATTTTACA 
AGATAGTGAA CGTGGTACAT TCAGTCATAG 

30 

TGAAACATAT ACACCTTTAC ATCATGTTCC 
TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG 

35 AAAAAGCTTC AATATTTGGG AAGCACAATA 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC 
ATTCTTACCT CATGCATATG AGGGTCAAGG 

40 ATTTTTACAA TTAGCTGCTG AAAATAATTG 
TTATTTCCAC TTATTGCGTG CACAAGCGGC 
GGTTGTTATG TCACCAAAAA GCTTACTGAG 

45 

ATTTACTTCT GGTGGATTTG AGCCAATTTT 

AAAAGTTATT TTGGCAACTG GTAAAATGTT 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT 

50 

GATTGAAGCA TTACTAGCAC AATTGCCAAA 
ACCTAAAAAT CAAGGTGCAT GGTTATATGT 

55 



TGAGGGGCGT ACGAGAGCAG CACAAGATGA I86 0 

T CAT CAT AAA GCAATGCCAA TTATTATACA 192 0 

TAACTTCGAA ACAATGAACT TAGGAAACTT 198 0 

TATTATTACT AACAATAGAA TTGGATTTAC 2 04 0 

TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

CGTTGAAGCT ACTATTGAAG CAATTGATAT 216 0 

AGACGTCGTT ATTGATTTAG TAGGTTATCG 22 2 0 

ATCAATTACT AATCCaGTTC CTTATCAGAA 22 80 

GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2 34 0 

AGAACAAGTC CAAAAGGAAC TAAGACAAGC 24 0 0 

GGATAATCCA GATATGGAAA AGCCTGCAGA 24 6 0 

ATCATTTACT TTTGATCACT TGAAAGAAAT 2 520 

CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

AGAAGATGGT TTAGTTGATT GGGCACAAGC 2 64 0 

AGATGGTACA CCGATTCGCT TAACTGGTCA 2 700 

GCATGCCGTG TTACATGATG AGCAAACAGG 2 76 0 

TGATCAAAAA GCGACATTTG ATATACACAA 2 82 0 

TTTTGAATAC GGCTATAATG TGGAAAACAA 2 8 80 

TGGTGATTTT GCAAATATGT CACAAATGAT 2 94 0 

AAAATGGGGA GAACGTTCAG GATTAACATT 3 00 0 

GCCTGAACAT TCATCAGCAA GATTAGAGCG 3 060 

CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

AAATAAAACA GTTGCAAAAC CAATTGATGA 3 24 0 

GACAGAATCA TATCAAGCGG ATAAGGTTAC 3 3 00 

CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3 3 60 

TGAAAGATTG TATCCATTCC CAGAGGAAGA 3 4 20 

CCTTGAAGAA GTGTCATGGG TACAAGAAGA 34 80 

CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3 54 0 
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AGAAATT CAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3 66 0 

GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTATT ACAGAAGGTA 3 720 

5 CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3 7 80 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3 84 0 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 3 900 

10 

AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 3 960 

AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 4 020 

j5 AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 4 080 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTG GTTCGTAAAG 414 0 

AAGATATTGA T AAG AAA CAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 4200 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTCGT GAAAAAATGT 4 260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 4 320 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 4 380 

^ ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 444 0 

TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 4 500 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4 56 0 

30 

AC CATTTG TA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTG cTAA 4 62 0 

TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 4680 

35 TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 4 74 0 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTGATCAAGA 4 BOO 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4 85 8 

40 (2) INFORMATION FOR SEQ ID NO : 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
4S (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

50 

ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 6 0 

TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 120 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



ATGCTATTTA 


GCTAAAGCTA 


AAAG AC CAGA 


CACTATGCAT 


ATTTCAACTG 


GAAATATGTG 


240 


GCGATACTTA 


GTTGCAATTA 


TTGCCTGTAT 


GATTTGGTAC 


CTTAATAAAG 


CGCATGTAAG 


300 


TATCATCGGT 


ATAATTATTG 


GTTTAATGAT 


TTCATATGTT 


GTAGTTATCA 


TACGTCCTTT 


360 


ACTAAAGGTG 


AGCAAATAAA 


TTAAGAAAGA 


GGTGAGATTA 


TGGATCACAA 


ATCCCCGCTC 


420 


GTGAGTTGGA 


ATTTATTCGG 


TTTTGaTATC 


GTTTTCAATT 


TATCAAGTAT 


ATTGATGATA 


480 


CTTGTTACGG 


cGTTTCTTGT 


TTTTCTACTT 


Gc TATCATTT 


GTACGCGTAA 


TTTGAAAAAA 


540 


AGACCAACTG 


GCAAACAAAA 


TTTCGTTGAA 


tggatttttg 


ATTTCGTGAG 


GGGAATCATT 


600 


GAAGGTAACA 


TGGCTTGGAA 


AAAAGGTGGT 


CAATTCCACT 


TCTTAGCAGT 


AACGCTGaTT 


660 


CTGTACATTT 


TTATAGCTAA 


TATGTTAGGT 


CTTCCGTTTT 


CTATAGTAAC 


GAAAGATCAC 


720 


ACATTGTGGT 


GGAAATCACC 


GACAGCnGAT 


GCAACAGTGA 


CTTTAACGTT 


GTCTACAACG 


780 


ATAATACTGT 


TAACTCACTT 


TTATGGAATT 


AAAATGCGTG 


GTACGAAACA 


ATATCTTAAA 


840 


GGTTATGTAC 


AGCCGTTTTG 


GCCATTGGCA 


ATTATTAATG 


TTTTTGAAGA 


GTTCACTTCA 


900 


ACATTAACGC 


TTGGTCTGCG 


TTTGTACGGT 


AACATATTTG 


CAGGTGAGAT 


ACTATTAACA 


960 


TTACTTGCTG 


GCTTATTCTT 


TAACGAACCA 


GCATGGGGTT 


GGATTATTAG 


TATCCCAGGA 


1020 


TTAATTGTTT 


GGCAAGCATT 


TTCAATATTT 


GTAGGAACAA 


TCCAAG CATA 


TATCTTTATT 


1080 


ATGCTTTCGA 


TGGTTTATAT 


GTCACATAAA 


GTGGCAGATG 


AACACTAAAA 


ATTTCAATAA 


1140 


TTATATACAA 


TCACAGGAGG 


AAATT AAATT 


ATGAATTTAA 


TCGCAGCAGC 


AATCGCAATT 


1200 


GGTTTATCAG 


CATTAGGAGC 


AGGTATCGGT 


AACGGTTTAA 


TCGTTTCAAG 


AACAGTTGAA 


1260 


GGTGTAGCAC 


GTCAACCAGA 


AGCACGTGGT 


CAATTAATGG 


GTATCATGTT 


CATTGGTGTA 


1320 


GGTTT AG TTG 


AGGCATTACC 


TATCATCGGT 


GTAGTAATTG 


CATTCATGAC 


ATTTGCTGGA 


13B0 


TAATTAACAG 


ATAAAAGAGG 


TCGGGACAAA 


GCGCATAGGA 


CATAATTCAT 


GATGCATATA 


1440 


TAGTAATATC 


TTTGAACTTT 


ATTAAATAGT 


TGAGATATGA 


ACGCACCATG 


CCTATCGCAT 


1500 


AAATT CAGTA 


GGTCCTAACC 


TCGTCGTTTT 


TTTCTATATA 


ACACTAGCGA 


TTATTTTAAT 


1560 


GAAAGGAGTG 


TCATGAACCC 


GTGACTGAAA 


CAGCTAACTT 


ATTCGTTCTT 


GGTGCAGCTG 


1620 








AGGTCCTAAC 

JV-JVJ 1 1 AAV — 


TTTCATCGTG 


TTAcTTGCGT 


1680 


TACTTAAAAA 


GTTCGCATGG 


GGTCCATTGA 


AAGATGTAAT 


GGATAAACGT 


GAAAGAGATA 


1740 


TTAACAGAGA 


TATCGATGAC 


GCAGAACAAG 


CTAAGTTAAA 


TGCACAGAAA 


CTTGAAGAAG 


1800 


AAAATAAACA 


AAAACTTAAA 


GAAACACAAG 


AAGAAGTTCA 


AAAGATTTTA 


GAAGATGCTA 


1860 


AGGTTCAAGC 


ACGTCAACAG 


CAAGAACAAA 


TTATTCaTGA 


AGCAAACGTA 


CGTG CAAACG 


1920 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGAAA 



2040 



TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 



2100 



AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACGTGTCAT TAGATACAAA 



2160 



TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 



2220 



TG 



2222 



w 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT TTACATTGTA ATTCACTATC 60 

TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC GATTGTAACA 120 

ATT AT AT G CA AGTTTT CAGG ATTAACACCT AATCTTTGAA TGATTTGTTT AATAGTATAA 180 

TAATCCATCC AATAAAAATT ACTTCCTTTA ATATAAATGT TTTTAGGTTG ATACATTTGA 24 0 

CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA TACTTGAAGA ACTTTGATTA 3 00 

GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA CTAATTGGGA ATGGTTAATT 3 60 

GAATGAGATG TTGGACTATA TCTTTTTCTA ATTAATTTAG GAGTGTTACC ATACGTTTCT 4 20 

TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT GACTACTTGC GATTTCTTCA 4 80 

ATAGGCTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT CCAGTCTAAT ATGATTTAAA 54 0 

TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT CACTTAAAGC TTTGTTTGTA 6 00 

ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT TATGGTGGTT GTTTGTTAAA 660 

TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT CGTATGTATT TAATGACATT 720 

AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT TATAAGACTG TTGTTTAATT 730 

TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC TAGCAACCTC ATTTGTTAAT 84 0 

ATCTTGCCAC G CAT CAT ATA GTTTGCATTC ATAAACTTCA AAAATGTCTT TGCACTTACT 900 

TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT GATACAAATC ATTCAACATA 960 

ATAATGGTGA CATCATTTCT TTTTACATCA TATTGTTGTA GATTGATTGA AACTACACCT 1020 

GTACCTTGTA G C CAAT AT AT AATTTT t AAG TTCGCTTCCT TCACACTACT CATTTTCATA 10 80 
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ATT 114 3 

(2) INFORMATION FOR SEQ ID NO: 3 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCAC C AACAGCATGC 6 0 

CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 12 0 

TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 18 0 

CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 24 0 

ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 3 00 

GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 36 0 

25 AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 42 0 

AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 48 0 

TCAAATACAA ATTCGTTAAC GTCACT 50 6 
30 (2) INFORMATION FOR SEQ ID NO: 3B3: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

40 

AGCATCTGCA AtGAGTcTCT AATACAACGA TACGTTTTGC ATCTTTAGGT ACTTTTACTG 60 

TACCATTTTC ATCTTTTACC GAAATAGTAT CTTTAGTTGA TGATTCTTTT TTACTTGAAT 120 

45 TATCCGTATT ACCACAAGCT G C AACTAAAA GTAAGGCAAC TATTAATCCC AATATACTAA 180 

AAGTTTTTAG ACCTCTCATC GTTCCACTCC TTAATATGTA TAACTTCATT TATTATTTTA 24 0 

TTGATAACAA TTATCATTGT CAAGTAGCGT TCAATCTTTT TTATATTTCT AAAATGTATG 3 00 

50 ACTATATATT TCCTCTAATA ATTATGACTA CAATTAGCAC ATTTCCTTAG ACAAAATACT 360 

GATAATGTAT CATTGCTATA TCATCTTTGC ATTAATACAA TTGACACCAT TTAGCATGAC 42 0 

55 
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(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 
s (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



20 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

TGCACTTGAT ACTTTAGCAC ATGAAATTGC TAATCGATTA GTTGCAAATG ACAAGAATGA 60 

AGCAACTTTG GAAATGACTA ATAAAATGGC AACGATTCGT TTTACAGAAC CTACGCTGAT 120 

TGCATTAGCA GGGGGTAATG TCAAAGCTTA CACTGAGCAT ATGACTATAT CTCCATATAA 180 

ATTGTATTTG TTAGATAAAG GCGATGTTTT AAAGTTTAGA GAAACAAGTT ATACATCGCG 24 0 

AGTGTATTTA GCTGTgGGAG GCGGATTTGA ATTAGATGCA TGGTTAGGAT CTAACTCAAC 300 

CGACTTTAAT GTAAAAATTG GTGGTTTTAA AGGTAGAACA TTACAAGATG GCGATGAAAT 360 

AAAGCTTAAG AGAGATTATA CAGCTCGTCA TCATAAGTTA TTTGAAAACC TTGCTCACAC 420 

GAAACAAACA GATTGGGGTA TTGATGGATA CGCCTTGTCA TTTAATTATA TGTCTGATGT 480 

ATTTCATGTC GTTAAAAATA AAGGTACGGA AGATTTTAAA GAAGATGCCA TTCAAAGATT 54 0 

TGTGAAACAT GATTATAAAG TAACGAGCAA AGCAAATCGC ATGGGGATGA TGCTTGAAGG 600 

TGAAAAAATC AAAGCTTTTT ATGAAGATAT GCCACCGTAT CAGACTGTCA AAAAAGGAAC 660 

GATACAAATT AAGCGTGATG GCACACCTAT T AT C CTATTA AATGATCATT ATACGCTAGG 72 0 

TAGCTACCCG CAAATCGGTA CAATCGCAAG TTATCATTTA ACGAAATTAG CACAAAAACC 7 80 

GCAAGGATCA CGTTTGAAAT TTCAATTTAT AGATATTTTA ACGGCTGAAA AGAACCTTGT 84 0 

TAAgTATAGT AACTGGTTAA AC 862 
(2) INFORMATION FOR SEQ ID NO: 3 85: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
50 AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 6 0 

CTTCTACCTG TCTsACATTA TCTTTCCAAC G ATTAAT AC C TAAGCGTTTA G AAA CA C G TT 12 0 

55 



1280 



10 



20 



EP0 786 519 A2 

CAGTTTTACG TCCTACACCT GCTAAACTTT CTAATTCCTT ATGTGTTTGT GGTATTTCTC 24 0 

CATTAAATTG ATCAATCAAA GATTGACAAA GTTTCTTAAT ATTCTTAGCT TTGTTACGAT 300 

ACAGACCGAT AGAACGAATA TCATTCATAA GTTCTTCATC ACTGACTGCC AAATAATCTT 36 0 

CAGGCGTTTT GTATTTTTTA AACAGCTCAG TTGTTACTCT ATTTACTAGA ACGTCTGTAC 42 0 

ATTGCGCTGA CAATAATACA GCAATAGTTA ATTCGAACGG ATT AT CAT GT TTTAATTCAC 4 80 

ATT CTG CATC CGGAAACATA TTTGCTATAA CATCAATCAT TTCTAATGCT TTTTTCTTAC 54 0 

TTACCATCAA GGTTCTCCCC GTTTAACCAA TCAAATTTAG GTACCGTTTT AACTGTGTGC 600 

GTCATTTTCG GTTTATTGAA TTTTTCTCTT ATTTTTCTAG AATCGTCAAT TGTTTTGACA 660 

TTGTTTTTCT TCCAATTAAG TAAAATACGA TCTATATATT TAAAGCTAAG TTTATTCAAA 72 0 

CTATTCGCCT CGTCTAATGC CGCTTGTATA ATTGCAGTAT CGTGTTTATC AACATCAATC 78 0 

CATTGATTTA ACGTTTCTAT TTCATATGGA GATAACGGCC TTGCAAATGT ATC CTCTAAA 84 0 

ACTCTAAATA ATTG TTTAAA TTTTTCTTTA CTATTTAGCT CTTTCGTTTC CATACTTTGT 900 

TGCTTCAATA TATGACTTAA TTTTTCGAAA AAAGGATCTA G ATT CAT AT A TTCGGkAAAT 96 0 

CTACCTTCyT CATCyTTTTG aACTkGtAAT t CTAGCAATT CACGTgTATC AAATTTTGGA 102 0 
TACCATT 

(2) INFORMATION FOR SEQ ID NO: 3 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 86: 

AAGGnTTGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC GCATGGAATG 6 0 

TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA TCTATGTATG 12 0 

GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGGA CAAGCAATAC 160 

is CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 24 0 

TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT CGTAAAAATT 3 00 

ATCATGTTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT GCAGACAACG 3 60 

CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA ACCTGGATAG 4 20 

GCATACCTTT ATGGGTGCAC AGACGATGTT TAAAACCTAT GTTCACGATa GCTAa CCAAa 4 80 
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GGTATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGAGCAT GGTGAAAAaG 6 00 

TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 6 60 

GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAGC 720 

AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 7 80 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 84 0 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 900 

ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 960 

GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 1006 
(2) INFORMATION FOR SEQ ID NO: 387; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 6 0 

GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 12 0 

30 

ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATGAAAGAAG 180 

CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 24 0 

CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTGTTT 3 00 

35 

ATGGGATAAG TGATTATGGT TTAAGTCAAA GTTTAGGTAT TACT CGT AAA AAAGCAAAAG 360 

CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 420 

40 TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 480 

CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 54 0 

CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 6 00 

45 AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 6 60 

TT 662 
(2) INFORMATION FOR SEQ ID NO: 388 : 



50 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
5 TCATCCAAAT TTTGGAAATT CCACATTTTA CATATCGTAA TTTTTTAGGA AACTAGTGAA 60 

TATAACAAAT CCCTCCTCTC ATTTTTAAAA TAGATATATC ACTTCCCCAC TTTTACTTAA 12 0 

CTAAACTGCA ACGGTTCCTA ATACCAAAAT CCTGCCCTCT ATTTTTATCA ATTCAAGCAT 180 

10 ACTTATTGAA AAATGTTAAC GTTTTCTTGA TAATCATTGT AAGCGCATTT ATTTTATAAA 24 0 

CTAACGTTTG AAATATACTA CAGGAGTGAC ACGTAATGAC TCAAATTACT GAAAAAGAAT 3 00 

TAAAAAAGAA GTATTTAGAT TTACTATCCC AAAATTTTGA TACTCCAGAA AAACTTGCAA 360 

CTGAAATTAT CAATTTAGAA TCAATTTTAG AATTACCTAA AGGTACGGAA CATTTCGTCA 4 20 

GCGATTTACA TGGTGAATAT GAAGCTTTCC AACACGTATT ACGCAACGGT TCTGGGAACG 4 80 

TGCGAGCGAA AATCAATGAT ATTTTCAAAG AGAGACTTTC AACTAAGGAG CTTAATGACT 54 0 

TAACTGCTCT TGTCTACTAT CCAGAAGrCm AATTtAAAAT TgATTAAAAG TGATTTCCAA 6 00 

AATTgCGGtC mACtTAATGt CyGGtATATC ACaACr\ATCG aACATTTAAT TGAGTTAATT 660 

AAATATTGT 66 9 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



J * (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389; 

CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGGCAATTTT 60 

TATCAAACGA ATGTAGATGC AAATGGTGTT AAT CATGGTG GTAGTGAAAT GGTG CAAAAT 120 

40 

AAAACAGGTC ATATGAGTCA ACAArGCCAT ATGAATCAGA ACACACATGA ACCAACAGCC 180 

ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 24 0 

TATGCATTCA TCAAATCATC AAATGAACCA AAGTAACAAA AAAGTTTTAC CAGCTGCTGG 3 00 

45 

TGAAAGTATG ACATCAAGTA TTCTTACTGC AAGTATTGCC GCACTACTAT TAGTATCTGG 360 

GTTATTCTTA GCATTTAGAC GACGTTCAAC AAATAAATAA A CAT AAT ACG ATTAATAATA 420 

50 GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCCCnCTA 4 80 

AtGTTATATT GGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 54 0 

55 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT 660 

AATCATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 7 20 

5 ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 7 80 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TOATGTCGTC 84 0 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

10 

AAATGCATCA GCTTGCTTAG AATGATTGTC CTTTTTTTGA TAATAGCGTT CCATTGCAAT 960 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 1020 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 10 8 0 

15 

TTG AT C CAT A CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 114 0 

TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 12 00 

2Q ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 124 9 

(2) INFORMATION FOR SEQ ID NO: 3 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 



35 



40 



45 



AATGATGATT 


CATTCAAAGA 


AATTAACGCT 


CGGTATATGC 


TTGGTATTAC 


TCATTATATT 


60 


GATTGTAGGT 


TATGTCATTA 


TGACAAAAAC 


AAATGGTCGA 


AACGCCCAAA 


TTAAAGACAC 


120 


ATTTAATCAA 


ACATTAAAAT 


TATATCCAAC 


CAAAAATCTC 


GATGATTTTT 


ACGATAAAGA 


180 


AGGCTTTCGA 


GAT CAAGAAT 


TTAAAAAGGG 


TGATAAAGGT 


ACTTGGATAG 


TTAATTCTGA 


240 


AATGGTAATC 


GAGCCAAAAG 


GTAAGGATAT 


GGAAACGAGA 


GGAATGGTGC 


TCTATATCAA 


300 


TCGCAATACT 


AGAACCACAA 


AAGGGTATTA 


TTTTATAAGT 


GAAATGACAG 


ATGACAGTAA 


360 


CGG CAGACCA 


AAGGATGATG 


AAAAAAGGTA 


TCCGGtAAAA 


ATGGAACATA 


ATAAAATCAT 


420 


ACCAACGAAG 


CCACTACCGA 


ATGACAAGTT 


AAAAAAAGAG 


ATTGAAAACT 


TTAAGTTCTT 


480 


TGTACAATAT 


GGCAACTTTA 


AAGATATTAA 


TGATTATAAA 


GATGGTGATA 


TTTCATATAA 


540 


TCCTAATGTA 


CCAAGTTATT 


CGGCAAAATA 


TCAATTGAAT 


AATGATGATT 


ATAATGTCCA 


600 


ACAGTTaAGa 


AAAAGATATG 


ATATTCCAAC 


CAAACAAGCG 


CCGAAACTAT 


TATTGAAAGG 


660 


CGATGGAGAT 


TTAAAAGGTT 


CATCCGTAGT 


TcTAGAAGTC 


TTGAATTTAC 


CTTTGTCGAA 


720 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT ATAACAAGTA ACTCTGAAGA 84 0 

AACAAGTTCG ATATCTAAAG TGAGCTGTGA AATATAGGTA GCCATTTCAA AAAATTTAAA 900 

5 GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA AGGCGAAGAA TGAACTGGAT 960 

GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG TTTAATAGCT TATATGATTC 102 0 

TTGGAGCTAA GACAGCATGC GTTCATTCAT GCCATTATTA ATATAAGCAC CG CAACAAAA 10 80 

10 

AAGCTTCTAA TGTGATACAG GAAC CTCATA TTCCGTATCA TGTTAGAAGC TTTTAATGTC 114 0 

TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA AACTTTTATA TAATTAAATA 12 00 

TTTCTTAATT TTCCAAAATA GTGATAAATT TGTGAAATAC ATCACAAATC CCTTTATTTA 126 0 

15 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT AAATAGAGAG AGACGAGAGA 1320 

GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT GAGAAAAGAG ATTGAAGCGC 13 80 

TTATTTTCTC AG ACGTAA t A GCTATGATAT TTACGTGAAC ACTGGTGTAA AT CAAGG ATT 144 0 

20 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT ATGCCTTACA TTGATGCTGA 1500 

GCGTTTGTAT CACTTTGCTA TGGAACGTAA ATCGTTAGTC ACTAACTAGT TCTTATTGCC 156 0 

25 AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT TGATGTAACT TGCCCTCGTT 162 0 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC CTAACTCCCC ACAAATCTGG 168 0 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT AGTCAkACTT ATATTATTTT 174 0 

30 tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC ATGnCTGC 178 8 

(2) INFORMATION FOR SEQ ID NO: 391: 

<i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2407 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 

GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 12 0 

4b 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 

ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

50 TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 300 

TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 36 0 

55 
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TATGGCATTG GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCGAA GAAAATCTCA 480 

ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 54 0 

5 TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 600 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTCGGT 66 0 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 72 0 

W 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATATTAT ACTCATTTTA 780 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC B4 0 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 900 

15 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 96 0 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 102 0 

GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCTTGA 1080 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 114 0 

CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

25 GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTCAGT TTTATGTTTA ATCACATTCG 126 0 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 132 0 

TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 13 8 0 

30 AGTTTGGAAG GTATGCGAAT AG CAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 144 0 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTG T CTTT 1500 

GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 156 0 

35 

TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 162 0 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 16 8 0 

TTGCATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 174 0 

40 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 180 0 

AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 1860 

TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

4b 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 1980 

GAAGATGAAC AGCCGATGGT CG CT CAT ATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 2 04 0 

50 ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 

55 
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GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCGA AOAAGACATT 



2280 



GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 



2340 



GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA 



2400 



ATTAACG 



2407 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

ATGATGnATG GCnCCGCCAA GAAGTTGAAC CAGTCTATTG GCTAGAATTG CTCAAAAAGC 6 0 

GAGAcaTGTG GTATTCaTAT GTTaGTAgCt ACGCAAAGAC C AT CTGTCAA TGTAATTACA 120 

GGTTTAATTA AAGCCAACAT ACCAACAAGA ATTGCATTTA TGGTATCATC aAGTGTAGAT 180 

TCGAGAACGA TATTAGACAG TGGTGGAGCA GAACGCTTGT TAGGATATGG CGATATGTTA 24 0 

TATCTTGGTA GCGGT ATGAA TAAACCGATT AGAGTTCAAG GTACATTTGT TTCTGATGAC 300 

GAAATTGATG ATG TTGTTG A TTTTATCAAA CAACAAAGAG AACCGGACTA TCTATTTGAA 360 

GAAAAAGAAT TGTTGAAAAA AACACAAACA CAATCACAAG ATGaATTATT TGATGaTGTT 420 

TGTGCATTTA TGGTTAATGA AGGACATATT TCAA CATCAT TAATC CAAAG ACATTTCCAA 4 80 

ATTGGCTATA ATAGAGCAGC AAGAATTATC GATCAATTAG AGCAACTCGG TTATGTTTCG 54 0 

AGTGCTAATG GTTCAAAACC AAGGGATGTT TATGTTACGG AAGCAGATTT AAATAAAGAA 600 

TAATTATGAG TAAGGAGTTT TATATAATGA CACACTATCA TTTTGTCGGA ATTAAAGGTT 660 

CTGGCATGAG TTCATTAGCA CAAATCATGC ATGATTTAGG ACATGAAGTT CAAGGATCGG 72 0 

ATATTGAGAA CTACGTATTT ACAGAAGTTG CTCTTAGAAA TAAGGGGATA AAAATATTAC 78 0 

CATTTGATGC TAATAACATA AAAGAAGATA TGGTAGTtAT ACAAGGTAAT GCATTCGCGA 84 0 

GTAGCCaTGA AGAAAtAGTA CGTGCACATC AATTGaAATT AGATGTTGTA AGTTATAATG 90 0 

ATTTTTTAGG ACAGATTATT GATCAATATA CTTCAGTAGC TGTAACTGGT G C ACATGGT A 96 0 

AAACTTCTAC AACAGGTTTA TTATCACATG TTATGAATGG TGATAAAAAG AC TT CATTTT 102 0 

TAATTGGTGA TGGCACAGGT ATGGGATTGC CTGAAAGTGA TTATTTCGCT TTTGAGGCAT 10 80 

GTGAATATAG ACGTCACTTT TTAAGTTATA AACCTGATTA CGCAATTATG ACAAATATTG 114 0 
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TGGCACATAA TGTTAAAAAA GGTATTATTG CTTGGGGTGA TGATGAACAT CTACGTAAAA 126 0 

TTGAAGCAGA TGTTCCAATT TATTATTATG GATTTAAAGA TTCGGATGAC ATTTATGCTC 13 2 0 

5 AAAATATTCA AATTACGGAT AAAGGTACTG CTTTTGATGT GTATGTGGAT GGTGAGTTTT 13 80 

ATGATCACTT CCTGTCTCCA CAATATGGTG ACCATACAGT TTTAAATGCA TTAGCTGTAA 144 0 

TTGCGATTAG TTATTTAGAG AAGCTAGATG TTACAAATAT TAAAGAAGCA TTAGAAACGT 1500 

10 

TTGGTGGTGT TAAACGTCGT TTCAATGAAA CTACAATTGC AAATCAAGTT ATTGTAGATG 156 0 

ATTATGCACA CCATCCAAGA GAAATTAGTG CTACAATTGA AACAGCACGA AAGAAATATC 1620 

CACATAAAGA AGTTGTTGCA GTATTTCAAC CACACACTTT CTCTAGAACA CAGGCATTTT 1680 

is 

TAAATGAATT TGCAGAAAGT TTAAGTAAAG CAGATCGTGT ATTCTTATGT GAAATTTTTG 174 0 

GATCAATTAG AGAAAATACT GGCGCATTAA CGATACAAGA TTTAATTGAT AAAATTGAAG 1800 

20 GTGCATCGTT AATTAATGAA GATTCTATTA ATGTATTAGA ACAATTTGAT AATGCTGTTA 18 60 

TTTTATTTAT GGGTGCAGGT GATATTCAAA AATTACAAAA TG CAT AT TT A GATAAATTAG 1920 

GCATGAAAAA TGCGTTTTAA TATGTTTATA ATAGAGTAGT ATGGGTATTT ATTATTAATG 1980 

25 ACATTATTAC ATGTTAATTA GGAGGCGTTT TTAATGGATT GGATTTTACC AATTGCTGGA 204 0 

ATTATCGCTG CGATTGCATT C TT AATTTT A TGTATCGGTA TCGTAGCTGT ATTAAATT CT 2100 

GTTAAGaAAA ACTTAGATTA TGTTGCAAAA ACACTTGACG GTGTAGAAGG TCAAGTTCAA 2160 

ID 

GGTATTACTC GTGAAACAAC AGATTTACTT CAT AAAG T AA ACCGTTTAAC TGAGGATATC 2 220 

CAAGGTAAAG TAGATCGTTT AAA CT CAGTT GTAGATGCTG TTAAAGGTAT CGGTGACTCA 2280 

GTACAAACGT TAAACAGCTC TGTAGATCGT GTAACAAATT CAATTACACA TAATATTTCT 2 34 0 

35 

CAAAATGAAG ATAAAATCTC ACAAGTTGTT CAATGGTCAA ATGTTGCAAT GGAAATTGCA 24 0 0 

GACAAATGGC AAAATAGACA CTAC 24 24 
(2) INFORMATION FOR SEQ ID NO: 3 93: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 6 0 

GTGGGTAGTT TTTTT AG TTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA 12 0 
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AG7GTAAAGT TTTCAACATA ATACTATTAG TTCGGTCATG TATCGGACTG ATGGAAAAGC 240 

GTTTCACTTT TAATGACTCA TTAAGAACGG CCTGAAAATG TTTGGCGTAT TAAGTGCAAT 3 00 

5 

GATAGTTTTG ACATTTAGTT TCTAATTGGT CATTACTGCC GAGCAAATCT AGTAGAGTAA 3 60 

TCATGTAAAT CTTTAATGTG CCATTTGATT CACTAGCGGT GTTAATAACT ACGGAAATTG 42 0 

CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA TTTTAAACTG 48 0 

10 

TTATAAATGT GTCTCAATTT CATATGTTCA TCGACGATAT GAAGCGTATT ATGGTAAAAT 54 0 

GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG CACTTTATTA 60 0 

1S TTGTGTAGAT AATAGTTTTT TAACGAAATA AAAATGGCGA CTGGTTTTAA TAAATCAGCT 660 

AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT ATAGTTGGAT 720 

TTTGCAAAAT TGATGTTA 73 8 

20 (2) INFORMATION FOR SEQ ID NO: 3 94: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 ( D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: < 

30 

AAAAGTTGTA ATTAAAAGTG GGATTTTACT 
ATAAAAGAGn AGATGTCCGC AATATAGCAA 
CTTTAGTAGA TGAGTTGTTA AAACAATCTG 

35 

AACGTGCAAT GGACTCTAAC GATATCGAAA 
ATACGGCTGT TGATTATAAA GGTACACGTA 
ACTTTGGTGG AGAAGTAGAA CGTATTATGA 

40 

ATGCGTATGA AGGTACAATG CCTCAAACAC 
ACCTGAAACC TGTTGTTGTT GTTAATAAAA 

45 TTGTAGATGA AGTTTTAGAT TTATTTATTG 
TCCCTGTTGT TTATGCTTCA GCAGTAAATG 
ATGATAATTT ACAATCATTA TATGAAACAA 

50 AACAGTGATG AGCCCATTAC AATTTCCAAG 

GACGTATTGG TATTGGTCGT GTATTCAGAG 

55 



;EQ ID NO: 394: 

TAAGnnAGAA GGAAACTATT TATATGACTA 6 0 

TTATTGCTCA CGTTGACCAT GGTAAAACAA 12 0 

GTATATTCAG AGAAAATGAA CATGTCGATG 180 

GAGAGCGTGG AATTACGATT CTAGCCAAAA 24 0 

TTAATATTTT GGATACACCA GGACATGCAG 300 

AAATGGTTGA TGGGGTTGTC TTAGTAGTAG 3 60 

GTTTTGTACT TAAAAAAGCG CTAGAACAAA 4 20 

TTGATAAACC ATCAGCACGT CCAGAGGGTG 4 B0 

AATTAGAAGC AAACGriTGAA CAATTAGAAT 54 0 

GTACAGCTAG CTTAGATCCT GAAAAGCAAG 6 00 

TTATTGATTC ATGTACCAGC TCCAATTGAT 66 0 

TAGCATTGTT GGACTACAAT GATTATGTTG 72 0 

GTAAAATGCG TGTCGGAGAT AATGTATCAC 730 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTGCT GTTTCAGGTA 900 

TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 96 0 

5 TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 102 0 

CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 1080 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 114 0 

10 

CTGGTCG CGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 12 0 0 

AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 12 6 0 

CATTTGAACG 12 7 0 

is 

(2) INFORMATION FOR SEQ ID NO: 395: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 395: 

AAGACCAGGA GAACAmGTAA AACAATATAC AGTTGaAATC GCTCGTAAAT TAATGGAATT 6 0 

TGATATAAAA TGCTCGTGAT TGCTTGTAAA TACGCAACTG CTGTnGCTTT AGAATATTTA 12 0 

30 CAAAAGACCT TATCAATCCC CAGTGATTGG CGTAATTGAc CAGGTGCTAG AaCAGCAATA 180 

ATGACTACTA GAAATCAAAA TGTATTAGTA CTAGGAACGG AAGGCACAAT TAAATCTGAA 24 0 

GCATATCGTA CGCATATTAA ACGTATCAAT CCACATGTAG AGGTACATGg CGTTGCCTGT 300 

35 

CCAGGTTTTG TGCCACTTGT AGAACAAATG AGATATAGTG ATCCAACAAT TACAAGCATT 360 

GTCATTCATC AAACACTGAA ACGTTGGCGT AATAGTGAGT CTGATACTGT CATTTTAGGA 42 0 

TGTACCCACT ATCCATTGCT CTATAAACCT AT CTATGATT ATTTTGGTGG TAAAAAGACA 4 80 

40 

GTGATTTCGT CTGGATTAGA AACGGCTCGT GAAGTTAGTG CATTGCTAAC ATTTAGTAAT 54 0 

GAACATGCAA GTTATACTGA ACATCCAGAT CATCGATTTT TTGCAACAGG TGATCCTACT 6 00 

CACATTACTA ACATTATCAA AGAGTGGTTA AATTTATCTG TCAATGTGGA ACGTATATCA 66 0 

45 

GTGAATGACT AGGAGGATTT TTAATGAAAG AGATT GTTAT TGCATCGAAT AATCAAGGGA 72 0 

AAATAAATGA CTTTAAAGTA ATATTTCCAG ATTAC CACGT AATAGGTATT TCAGAACTAA 780 

SO TACCAGATTT TGATGTGGAA G AAA CAGG AT CAACATTTGA AGAAAATGCT ATATTAAAAT 84 0 

CAGAAGCTGC TGCAAAAGCA TTGAATAAAA CGGTCATAGC TGATGACAGT GGACTAGAAG 90 0 

55 
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20 



1290 



EP0 7S6 519 A2 



GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCGTG 102 0 

CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GG AAACAAAA GTATTTAAAG 108 0 

5 GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GGATATGATC 1140 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 
GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 1260 

10 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 132 0 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 1365 
(2) INFORMATION FOR SEQ ID NO; 3 96: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



25 



35 



40 



45 



50 



AATTCCTGGT 


GC AATAATAA 


ATAGGATGAA 


AAAGATTCGG 


AAAATATGAT 


AACTCGTAAT 


60 


CATAGCAACA 


TCGGCACCAG 


TAGCTAATGC 


AACTAAAACT 


ATCTGATTAA 


CCCCTCCTGG 


120 


TGCTGCACCA 


AGaAACAATT 


CATTAATAGG 


ATT ATT AT CA 


AAGAAATGTA 


TGATATAAAC 


180 


CATGATTAGC 


GCACCAATTA 


TCAACATAAT 


ATTTTGAATT 


GTAATTGCGA 


TTGCTAGTCT 


240 


ACCTTTTAAA 


TCTGACAATA 


AATGCGCAAT 


TTGAACTCCA 


ATTCTAATCA 


TATATATTAG 


300 


TTGTGCCATG 


TTCAACAACC 


AATGATCTAG 


TGTAAATGTT 


AAACCTGTAG 


AAAAATTCCA 


360 


AACAATTAAT 


ACAATGAGTG 


GTGCTAATAA 


TTGAAATGTT 


GGAAACTTTA 


TTTTAGACAT 


420 


AATTAGATAA 


ACTATAAAGA 


TAGCTATCGC 


TAAAATAACT 


ATTTGCCCTA 


TGTTTAATAC 


480 


TTGTGATAAA 


GGCAAGACTT 


TTGTTAACTT 


TCCATTCGCA 


TGCATGTTAC 


CATCATGAAA 


540 


AAAATATGAA 


ATGAACGGTA 


CTAAAACAAC 


AACAAATATA 


ATTCGTGATG 


TTTGCGTTAA 


600 


GCTAACAACT 


AACAAATTAG 


CACGTTTGTC 


TTGTTCAGCC 


ATGACCAGCA 


TTTGTGTTAG 


660 


TGCTCCTGGT 


ATAACACTTA 


AAATAGCTGT 


TTCTGTATTA 


ATACGTGCAA 


TTTTTTTAAA 


720 


AACAAATGCC 


ATTACTATTG 


CAATTAATAA 


TATCGAAATA 


GATACAACAA 


TAATCGAAAG 


780 


CCAATTGTTT 


TTAATATCCA 


TAACGACATT 


TTTCGTAAAC 


GTTGATCCGA 


TTTGCACACC 


640 


TAATAGTACA 


ATACCTAATT 


CACTAAGTAA 


GAATGGCCAT 


TTAATATCAA 


GTTTGAAAAC 


900 


TTTTACACAA 


ATGATTGATG 


CGATAATAGG 


ACCAAACATA 


AATGGAAGTA 


ATACGTGCGA 


960 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 108 0 

AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 1140 
5 GaCTTTCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 120 0 

GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 126 0 

TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTAC CTTCA 132 0 

10 

AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 13 80 

TCC 1383 
(2) INFORMATION FOR SEQ ID NO: 397: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

25 TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 6 0 

ATGATTAAGA ACCACGGTTT AATTa TTTAG TGAGATTCTT ATTTGG TGT A GGTGAGGCGC 120 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 18 0 

30 GAG CATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 24 0 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 3 00 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 360 

35 TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 415 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 6 0 

aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 120 

SO 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 
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w 



40 



50 



AATGTGGGGT CGTATAAGAG CTTGTAATGC TTTTTGGCGT AATTGATAAC TGCTTTAAAA 3 00 

CCTGCCCACA CAGATCGTGC ATAGTCAACA GAAAGACCTG CATCGTTTAA CAAATAATTC 36 0 

CTGAAAGCAG TACATTGCGT AGTAGTGATT TTGCCAATAG GGATATTTCC GAACCTTTCT 4 20 

TTTATGTGAG TATTATATTC TGTAGTTCGC TTTTCTATTG AGCGTGCAGA AAGATTTTCA 4 80 

TTTTTTAAAC GATCAAAAAA TATATATTCA AAGGGTTGAT TGTCCGAGTA TCCATATTTA 54 0 

ACATTTTGTA TAAATTCGCT TTCAGCTAGT TTGGCATCTT TCTTACGTTC AAACCCACGC 6 00 

TTCATTTTTC GTTTGTTATT ACCGTATACA TCTTTATATC TAATGGAAAA ATACCATTTA 66 0 

CCTGTATTAT CATCCTTATA TACTGGCATT TTGCTTCTCC CTCCTCAAAA TTGGCAAAAA 72 0 

ATAATAAGGG TAGGCGGGCT ACCCGAAATT TAGTACTAGG TACTAAATGT GATATAATAA 78 0 

AATAAAAAGT AGGTGATGTT ATGACATTTA AAAACAATCA TAATTTCAAT GAATTAGTTT 84 0 

TAACGAATGA AGACATTAGA ATTTTAAAAA ATGTCTTAGA AGATGCAGTC AGTGTTTATG 90 0 

ATGAATATTC GGTATGTAAT GAAGAATCCG ATTTTGCTTA CTGTTTATTA AGAGACTTAT 96 0 

ATACATTAGA CAGCTTAGCT ATTTCGTCAA ATAATGTTTG AATTATCGAA TTGTACTCTT 102 0 

CGATTTTAAT ACCATGCATA ATAGAGTTTC TGTGTTCAAT AGCAGCTTTG ACTGAATGTk 10 BO 

TTAAATGTTC TTCTATTAAA TCGTTGTTTk CCAtTTCGtk TAaAAATGTt CyTATATTCC 114 0 

T 1141 
(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 
ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAGGTAA 60 

CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 120 

ATATTGTATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 180 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA AGAAATTAAA 24 0 

ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTATTGAT TATTCATTAC 3 00 

TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 3 60 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 420 
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ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 540 

CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 6 00 

GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 660 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 7 06 
(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

20 ACACAATCTG AAGATTCACG TTGTGGTGCT GGACATGATC GAAAAATTAG AGCTGAACAA 6 0 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 120 

ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACTCCAGAGT TCAAAGATAT GCTTAArAAC 180 

TTGAATGTAA ATGATGTTCT ATATG CAGGT CATAAT AG CA CATGGGACCC TCAATCAAAT 24 0 

TCAATTGCGA AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 3 00 

GATAAAGATC ATAAACAACC AAAACAATTA GTCAATGAAG TTGTGACTGA AAAACCTAAG 3 60 

CCATGGGATG T ATATG CGTT CCCATATTAy aCGTTTACAA TGATTTTTCA GATCATTACC 4 20 

CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 4 80 

3s ACGAAGCGAG TTATATTGTT AAAATTTGAA TTGACTTACA TTTTAATAAA AT CATCTTAA 54 0 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 6 00 

GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTACCTACT ACATTCTATG 660 

40 TAGCAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 720 

ACATTATGTG TCTTCCAATC AACTTCATAT AATG CTGATA ATTTTTCTTC TTTTTTATCT 7 80 

ACATGGTTTT CACCAGACCA ATAGCCCCAG AAACCATGGC GATTCCAATC TATTTTAAAC 840 

45 

TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 9 00 

TGTGACATAA CAGCTAAAAA TTCTGGATTA AACCCTTCAG ACACAGTTAC AGG CATTTTG 960 

TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 1020 

50 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCGTATGGTC ATGTCCCATA 1080 

TTATTTAT CA AATGTGCTTC TACTTTCCAA CCTACACCTT TATGTGACGT AGATTGATCA 114 0 

55 
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(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

CCAGAATTAT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 60 

J5 CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaGCTG AAGATCAACA G T ATT AT AAA 12 0 

ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 180 

AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTTTTTATT 24 0 

20 GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 3 00 

ACAGACTTAA CTCAAATTTT AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 36 0 

AAACTGAATA CAATGAAATA TATTAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 420 

25 

GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAAAAT GATTTTTAAT ATTCCGATTC 4 80 

GCACTATTGA GTATAAATAT GGACAAGCGA TTTATGGAGT CGGAGCAGGT ATTACAATTG 54 0 

ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 600 

30 

TATAATG CAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 66 0 

TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 72 0 

35 ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGTGG T CAAT ATAGACTTAA 78 0 

AATCGTATTA AATG CTG AAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 84 0 

TAGTTTT 84 7 

40 (2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

50 

TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 6 0 

GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGtTTAT CAT CATG AAA 120 

55 



1295 



EP0 786 519 A2 

GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 24 0 

GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 300 

AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 36 0 

AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 420 

CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 480 

TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 54 0 

AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 60 0 

TAAATAGCAA TTGTTTAAAG ATAAACGTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 660 

GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 72 0 

ATAAGTAGAA ATAAGATTAT 74 0 
(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT AC G TAT AG C G TACGACTACC AATATAAGTA 6 0 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACGCAG TCATTGCGAT TAATATAATT 120 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 180 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 24 0 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 30 0 

40 GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 360 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 42 0 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 480 

AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 54 0 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 600 

GGTACTATCA AACGTTTTGA AATTTTTTCT 63 0 

(2) INFORMATION FOR SEQ ID NO: 4 04: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 2 54 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 



10 



15 



20 



25 



30 



45 



50 



ATTTAAACGT 


TCAGTTTCTA 


AAAGTGGAAC 


AATCCAAGAA 


GTACGTAAAC 


GTGAATTTTA 


60 


CGAAAAACCA 


AGCGTAAAAC 


GTAAAAAGAA 


ATCAGAAGCT 


GCACGTAAAC 


GTAAATTCAA 


120 


ATAATTAATA 


CCTCTGTTGA 


CTCCCTCAAC 


ACGAATATTA 


ATTATATAAA 


ACAAACATCA 


180 


CAAGTTAGTG 


TCTGACACTA 


ATATGTGATG 


TTTTTTTGTr 


GTCAATTTTT 


AATTAAAAAA 


240 


AGTTATATAG 


TTTATAAATA 


ATCAAATTGA 


TATTCTATAG 


GTTCTTATAA 


CTATAAAGTA 


300 


TATTCAATTT 


CATGTATAAT 


TAATGTGAGG 


GCGAGGTGAA 


ATTGTGAGTT 


ATAATAATTT 


360 


TTTACAAATG 


ACAACTATCT 


TGGAATCAAC 


GGCTGGAGAT 


ACTTGGGTTG 


AACAAGTTAG 


420 


CAATATAATT 


GTTCAACCTA 


TTTTTACGTT 


AATATTAACC 


TGTTTGACAT 


TCTTAGGATT 


480 


TGTATATCAA 


CTTT A CTCT A 


AAAAAATCAA 


TGCAGCTGGT 


ATTATCGCCA 


CATTATCATT 


540 


ACTTATTTTA 


TTTTTGGGAT 


TTCTAATCCA 


AGGAAATGTC 


AATATGCATT 


CTATCTTAAT 


600 


ATTCTCAATT 


GGCGTTATAT 


TAGTTGTAAT 


TGAATTATTT 


GTAGTTGGTG 


CAGTAATTGG 


660 


TATTATTGGC 


ATGATACTGA 


TAACTATAAG 


CATAACAACG 


CTCGGTGATA 


ATTTGCTATT 


720 


TATGCTTGCG 


AATGTTATCG 


TTGCCTTGAT 


TTTAACGATT 


GTAGAATGGG 


TGATATTAGT 


780 


GAAGATTTTC 


AACAGAAAGA 


TTCCGTTTTT 


GGATAAAGTT 


ATCTTAAAAG 


ATTCAACTAA 


840 


TTCTGAGTCA 


GGTTACAATT 


CTCATGATAA 


CCGCTCGCAC 


CTCGTAGGAA 


AGACTGCTCA 


900 


AACAGTTACA 


GATCTTCGAC 


CTG CAGGGAT 


TATTTTTTGT 


GAAAATGAAC 


GTATTGATGC 


960 


TGTTTCAGAT 


GGCAACTTTA 


TTTTGCGCAA 


TAAAACGGTA 


AAAATCCTTG 


AAGTTGAAGG 


1020 


AACAAGAGTA 


GTTGTGAGGG 


AAGTAGATTA 


ATTAAAAGGA 


GCGATACCAT 


GTTTAGTTTA 


1080 


AGTTTTATCG 


TAATAGCAGT 


TATTATAGTA 


GTTGCATTAC 


TTATTTTATT 


CTCATTTGTA 


1140 


CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT 


AGGTACATTG 


1200 


GTTGGTATGC 


GTTTACGTCG 


TGTATCTCCA 


AGAAAAGTTA 


TAGCGCCATT 


AATTAAAGCG 


1260 


CACAAAGCAG 


GACTAGCATT 


AACAACAAAC 


CAATTAGAAT 


CGCATTATCT AGCAGGAGGA 


1320 


AATGTTGACA 


GAGTTGTTGA 


CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 


1380 


TTCGAACGTG 


CTGCTGCAAT 


TGaCCTTGCA 


GGACGTGACG 


TATTAGAAGC 


GGTT CAAATG 


1440 


TCTGTTAATC 


CTAAAGTCAT 


TGAAACACCA 


TTTATCGCAG 


GTGTAGCAAT 


GaACGGTATT 


1500 


GAAGTGAAAG 


CCAAAGCTCG 


TATCACAGTT 


AGAGCTAATA 


TTGCTCGACT 


TGTTGGTGGT 


1560 
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w 



AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 16 8 0 

AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 174 0 

ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 1800 

GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA i86 0 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATTAGCT 192 0 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 198 0 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 2 04 0 

;5 GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC TATGCGCGAA AATAGTCATA AAGATAGACA 216 0 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGGCTTTTT 222 0 

20 TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 228 0 

ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 2340 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA 24 00 

25 ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 24 6 0 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 252 0 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 258 0 

30 

GGAAAAACGT G CT AG AG AT A TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 264 0 

GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 2700 

CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC 2 76 0 

35 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 282 0 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 28 8 0 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 2 94 0 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 306 0 

45 GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 312 0 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 3180 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG T AG AAAAAG C 324 0 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 33 0 0 

AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATT CAA CATC TGTTAGATTT 33 6 0 



50 
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GCAACGTATA TATGTTAATG CCATGAAAAA 
TGGTACAGGT AAGACATTCT TAGCTGTAGT 

5 

TGTTAAACGT ATTGTATTAA CAAGACCTGC 
ACCAGGAGAT TTGAAAGAAA AGGTAGATCC 
TACTGTTCTT GGGCGTGAAC AAACAGAGCG 

10 

GCCACTTGCA TATATGCGCG GACGAACATT 
GCAGAATACG ACACATGCGC AAATGAAAAT 

J5 AATGGTAGTT ACTGGTGACC AAACTCAAAT 

TAAGGAAGCG GTCAGTAGGT TACACAACGT 
GAGCGATGTA GTCAgACATC CATTGGTAAG 

20 TTAAATGTTT ACGATAGATT TTAGCGATCA 

ACAAATTGAA GATTTATTAG AATTTGCTAA 
GCTTTCTGTT ACATTTGTAG ATAAACAAGA 

25 TAArGwTAAr GTTmCaGATG tAaTCyCaTT 

TtTTAGTGGT CTTGATATAC CACGTGTTTT 
GCAAGrACAA GCAAACAATT ACGGACATTC 

30 

ACATGGATTT TTGCATCTAT TAGGTTATGA 
GTTTGGTCGA CAAGATACAA TATTAAACGC 
AGGTTTAAAT ATGCACTTGA TGGGCTGAAA 

35 

TTACATGTGT TTGCAATGAT TGTTGCTATT 
ATTGAGTGGA TATTTATACT CATTG CTATT 

40 ACTGCTATTG AATATGTTGT CGATTTAGTG 

GCTAAAGATA TTGCGGCTTT TAGTGTACTT 
TTAATAGTAT TTTTACCACA TTTTATAGCG 

45 ACCTCATTAT TTTCAAGAAG TTAGAAAAGC 

ATTTAAAGTA GGGGCTTATT TAAAAmCGAA 
AGAAAATGCT TCTTATCCAT TATCGATATG 

50 

TTCTCAAGGA TACAGACCAG GTGATTTTGA 
ACCGTCATCA CCTTGTGGTG CATGTCGTCA 

55 



TAATGATTTA GTATTTGGTA TAGGTCCTGC 3430 

TTATGCAGCA AAGCAACTCC GTAAAGGTGC 3 54 0 

TGTTGAAGCA GGAGAGTCAC TTGGATTTTT 3 6 00 

ATATTTAAGA CCTTTATATG ATGGTCTATA 3 660 

ATTTATTGAA AGAgGCATTA TCGAAATAGC 3720 

AGAAGATGCA TTTGTAATTC TTGATGAGGC 37 80 

GTTTTTAACA AGACTAGGTT TTGGCTCAAA 384 0 

CGATTT AC C T AAAGGTGTTA AAAGTGGACT 3 90 0 

TAAAGGTATA AGTATATTGA AATTAGATCA 3 96 0 

TAAGATCATT GAACATTATG AAGGAGAGAA 4 02 0 

CACAGGCTTA GTTAAAGATG CTTGGTATAA 40 8 0 

AAAAGAAGAG CATATAGAAG ACGATGCTGA 414 0 

AATACAAGAA ATTAATCGAA CATATAGAGA 42 0 0 

tGCTTTAGrA GrAGATGAGC CmGaGATkGA 426 0 

AGGGGATATA ATTATCTGtA CGGATGTAGC 4 32 0 

TTTTGAACGA GAATTAGGAT TTTTAGCATT 4 3 80 

T CAT ATG ACT GAAGCGGATG AAAAGGAAAT 444 0 

ATATGGATTA ACACGAGACT AATTATGAAA 4 500 

ATCTTAATTC aaaaaqacta TAAATTTCTT 4 56 0 

GTCTTTGGTC TCGTACTAAA TATTAATCGG 4 62 0 

GCATTAGTTC TCACTGTTGA AGCTTTAAAC 4 6 80 

ACCGTTGAAT ATCATGATTT AG CTAAAT A C 4 74 0 

ATAG TTTCAA TATTAGCATT TATTATAGGT 4 800 

TTATTTTAGG GAGGCATATA TGAGTTATCA 4 86 0 

ACAACAAGAA TCATATTCGC CATACAGTCA 4 92 0 

AG AC g GTAGA ACTTTTTATG GTACCAATGT 4 98 0 

TGCTGAACGA GCTAGTTTGG TATCGGCAAT 504 0 

ATCAACAACT GTAACCGTAG ATGCAGATAA 5100 

AGTTTTGAAG GAATTATGTG ATGATGATAT 5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 52 8 0 

TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 534 0 

5 TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 54 00 

GACAAGAGAT GACGCGCAAA TT AT ATT CAT TGATACGCCA GGTATTCATA AACCTAAACA 54 60 

CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 552 0 

10 

CATGTTTATG GTTAATGCCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 55 BO 

GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 564 0 

AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 57 0 0 

15 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 57 6 0 

TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 582 0 

20 TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 58 80 

TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 5 94 0 

AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 6000 

25 AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 6060 
AGTTT A CTT A GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 
TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

30 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 62 4 0 

CACGATTTTA AATG 62 54 

(2) INFORMATION FOR SEQ ID NO: 405: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
dn (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 180 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 24 0 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACT ATCTG CA 3 00 
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TGTATTTGTT CAGCTAATGG AAAACGACTA 
GCACCTGTAC TCGCTGTTAA AACTGCAATT 
5 TTCGCTTTAT CAAAACAACC TTCTTCAGTT 

TATAAGTTCC CTATACCATC TAATTGCATT 
CGTTCAAAAA CAGAAACACG TGCTTCTAAT 

10 

TTTAATTTAG AAGCACTACC TTTGAACATC 
ACATTAAAGT ATAAATGaCC ACTTGaATGc 

15 AATACAGATT GGAGATGTGG ATCTTG AT CA 

GAAACACTTA AATAATCTGA CATATAACAT 
TACACCATTT ATAAATTTAT AATGATCATC 

20 TTCATTCATT ACGACTTTAG CAGGTGTATC 

TAAAATAATA CGATCCGTTT TTAATAAACG 
AATTGTCTCG TCTAATACAG GTTCGTGATC 

25 

TTCGAAGTCT AAATCTGGAT TATCGTCTTT 
ATCACTGTCC TTCATTTCTA ATTGAAATAA 
ACGACTCATT TTTAACTCCT TCAAACGTTT 

30 

GTATGACATT ACTTTTCAAT AACGATTTGT 
ATCGCTGTCA TATTAGAAAT TGAATTAAAA 

35 GAAATATTAA CACCATGTTT TAATGCACAA 

CTCTCGATTT TTAAATCACG GCTTAAATTT 
TTTAATTCAG CAAAATGGCC AGTGATGCCT 

40 ACAGATAGCA CTTCTGGCGC TATTTCTACT 

ACTTTGACCA TGGATTGACC TCCTATTAAC 
AAAAGTTTGT ATTAAATTTA CCGCTTCTAA 

45 

ATGGAATAGT TGTATCAATA CCAAGAACCA 
TAATCGCTTC ATCTCGTGTC GGTTCATGTA 
so ATGGCGGTAT CGTATAATTA GTATAACATG 

GTGCAAGATA TTGCTCmATT TTACCTGGTG 
CATTAATTCT AAATTCAATT GCGTGTCCTG 

55 



TTTATCGTTG AATGGATATC TCGAATTGCG 420 

TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 BO 

AATTTTTTCT TTAATGCTTC TAATTTTTGA 54 0 

TTATTTACAT AAATTTGATA GTTTCCACGA 600 

AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 66 0 

ATGGCACTTA TAACGCTTTC TTTATCTTTC 720 

TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 7 80 

AATTTATATT TAATATATTT CGTTAAAGCT 84 0 

CACTCAATTT TATTTTTTTA TATTACTCAA 900 

ATCACTGAAT TGTTTTGTTA ATTCAACTGC 960 

ACTGTGTAAT ATTTCATATG TTGCCATTCT 1020 

TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

TTTAACGCCA GAAACTAGCC AATGAATAAA 114 0 

AATAAAGCTT ATCGCTTCAT TTATCGTTAA 12 00 

AGTTTGAAAA GCTTGCACTC GGGATTCTTT 12 6 0 

GTATTTTTCT TTATTTAATT ACTGAATTAG 13 2 0 

GTAATGTGAA TATTAATTTG CTTAGGTTCT 1380 

ATTGACGTTT GAATTTTGTT TGCAGTTTTT 144 0 

TATACATCTA TATATATGCC ATCTTCTTTA 150 0 

TTACGACTAA CTTTTTCTAA ATTTGTTTCT 156 0 

TCGACTTCCG AAGTAGCTAT ACTTGCAATA 162 0 

TTACCTAATT TTGAATTTGA ATAATCAGTT 16 8 0 

CTTCATCATT CATAATGCTA TTTTGCTCTA 174 0 

ATATATCGTT ATTCAATAAT TTAATATGGA 180 0 

CAAATTCACT TAGTGCACGA ATGCCAGCCA 1860 

TGATTAATTT CGCTACCATC GAATCATAAT 1920 

CTGACTCTAT TCGAACACCA TATCCACCTG 1980 

ATGGCATAAA GTTCTTGTAA GGATTTTCAG 204 0 

TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 
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CAGTTACAGG ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 2220 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 22 80 

CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 2 34 0 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGTTCT CCTAAATGAA 24 0 0 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 24 6 0 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 2 52 0 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCGC 25 80 

1$ CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 2640 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 27 0 0 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 27 6 0 

20 ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 282 0 

AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 2 8 BO 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 2 94 0 

25 

TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 3 0 00 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3 060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTA CTAAGAT 3120 

30 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 31 BO 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 3 24 0 

35 TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 3 3 00 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3 3 60 

CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 34 20 

TTCTTTCTTA AG CGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTT CCG TTAAAGTTGA 34 80 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 3 54 0 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 3 6 00 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 366 0 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3 710 

(2) INFORMATION FOR SEQ ID NO: 4 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1705 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



5 



10 



15 



20 



25 



30 



35 



40 



50 



GCTGATGTTC 


GTTGCCTTTn 


TCCACCAGAC 


AATTCAGAGG 


GATATTTATC 


ACTAATATCC 


60 


AATATATTTA ATGCTTCTGC 


TACTTTTTCA 


TAACGATTTA 


ACATATGTTC 


TTTATCTAAC 


120 


ttptpta ccci 

1 1 V_ 1 \j 1 AL. L-Vj 


TTAfyTY^VTAA 


CATTATGTTT 


TCTTTAACAG 


TCAATGTATG 


CAGTAAATTA 


180 


TAL I L 1 i L?AA 


A A ATA A A APP 
AAAl/Wv\LL 


AATATCATGC 


TTGCGTATAT 


CAGATwATTC 


CTTGTTTGAA 


240 


AGL 111 ILiA 


Ai ill 1 1 1 LL 


TTTTAATGTA 


ATAGAACCTT 


GTGAAATATA 


ATCAATTGAA 


300 


L J71 AAAALA i 


TT A ATA ATPT 


CGTTTTCCCA 


GATCCAGAGG 


GACCCATAAT 


AGCAATAAAC 


360 


TCGCCTTLT1 


/~" A AT , A('" , ?\f~ , A. f T' 

LAA I ALs ALA 1 


ATTGATATCT 


CGCAACACTT 


CTTGTGCCAT 


TTTTTTAGTT 


420 


CCATATATTT 


1 ILfI 1AA1 iv 


TTTTACTTCT 


AAAATTGCCA 


CTTTAACACT 


CCTATAATTT 


480 


AT CTT AACTT 


LAI I ILL 111 


AGGCTTTGGC 


ACTTGTATCT 


TCAATTTAAC 


ATATGACTAA 


540 


CATCTATCTT 


ATT A I AALlj 1 


TGAAGCTGCA 


TTGATGTATC 


AATTCTAAGT 


AACAAAACGC 


600 


ATGTTTAAAA 


TV'S"* H^MkH 'T w 1 " P 

TGALAAA III 


GTCACTTCCG 


ACATGCGTTC 


AACAATTTCA 


TTTTGTAATG 


660 


GGAAAATCAA 


TL 1 GAlALj 1 1 


GTCCCCTTAC 


CAACAGTCGA 


CGTGACTTGC 


AGGTGAATAC 


720 


CT AATTG AT L 


L 1 1 1 ALAL 1 A 


TTTACTAAAT 


ATAGACCCAT 


ACCTGAAGAC 


GTCGTTTCAT 


780 


TTCTGTTAGC 


LL? 1 1 LjALLj 1 A 


AATCCTCGTT 


CAAATATTCG 


CGGCATATCT 


TTTTTACTAA 


840 


TACLTL ILrLL 


A 1 Au 1 L 1 1 1 A 


ATATATAACG 


AAACATGTTG 


ATCATTTAAT 


TCTGTCCCAA 


900 


i jtf i if | rri "m ry>rji 

I 1 J. 


AAA A TTPTr& 


CTATATTTCA 


ATGCGTTTGA 


CAAAATTTGT 


CTAATAATCA 


960 


I AL bAtAL LA 


r P r TT M T A. T" 7\ TPT 
1 J. 1 IrtlrtlUi 


GTATAAACAT 


AATCATCCAC 


TTTAAAGTCA 


ACATCAAAAC 


1020 


A A*T*A P^^T'IT' 

LAA1 ALL 1 1 1 


A APPTPAPTA 
/\/\L- ^- 1 1 A 


ATATGTCTTG 


TTAATTGTAT 


TTCATCAATG 


ACCATGCGTT 


1080 


Tn APTr a p a p 

1 MMVj 1 Lj,MLJ\L. 


PTA ATP A A A A 


TACATATCTT 


TACGTTGAGA 


TTCTAATCTA 


GTAATATACA 


1140 


GCTGTGTATC 


TAGCATCGAG 


TTTATACGAG 


ACCATTCATA 


TAGTAATGCT 


TGTwTTCTTT 


1200 


CTTGATTTTT 


TTCTTGATCA 


ATTAATAATT 


TCATAGCTGT 


CACAGGtGTT 


TTTATGTCGT 


1260 


GCACAAATTC 


TGTAATGGTT 


TGTTCATGCA 


TGTTCAATTG 


CAACTGTTGC 


TCAACAACCT 


1320 


TTTCTTTGTG 


CGCTGAGATT 


TGACGATATA 


AATAATCAAC 


TGTATGACGT 


TGAAATGGCG 


1380 


TTTCCGCTAA 


ATCTTTATGT 


TTAATTTCTT 


CTATTTCTTT 


ATCTTTGTCA 


AAATGCTTAT 


1440 


ATAATTTTAC 


TTCTTTAAAA 


TATGTCAATA 


AAAGAAAAAT 


CATTGTTAAA 


CTTAAATTCA 


1500 


AAGAAACAAT 


ATAAAATAAA 


CTGTCTATTG 


GAAAATCATA 


ATCGATTAGA 


CTArTGCCTA 


1560 


ACATAAGGAA 


GTTTaAAAAC 


AATATCCAAA 


AtATCCAGTT 


CATGCGAGAT 


TTCAAAAAAT 


1620 
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GCACTATCCA TACTAATTTC AGATA 170 5 
(2) INFORMATION FOR SEQ ID NO: 407: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

15 TCAATATATC TTAAAATTCA ATGATTAAAT CnATTATCAC TAGACATnAA ATACATAAAT 60 

CCTATTCCCC ATTTTCATTT nTTAATTCAT AAATGAATCA ATAACCACCT AATAACAAAT 120 

CATATTATAC ACCTTTGTTC TCTATTTTTC TAAGGTTTAA AAAATATTTT TAGGTAAACC 180 

20 TAAAAATAGA TGTAATAAAA ACGCCTCCTC AGATATTTAT ATATCTATGA AGACGTTTAA 240 

ATACATTATA GATGGTCTGG TTCTGGGTGA ACGTATACTG AGGAAAtACC TTTTTTGTGC 300 

AAATGATGTT CGACATTGTC ACAAATTTGA TGCGCTTCTA CTAAGGAAAG GTTAGCATCT 360 

25 

ACAACAATTG TGACATCAAT AAACACACTA CTTCCATGGT AACGCCCTTT AATACTTTTA 42 0 

ACTTCTTGTA CTTCATCAAC TTCTAAAATA TCATTGCGAT ACGCTTCTAA TTCAGTTTCA 4 80 

TTGAAACCAT CACTCAACAT AAAAATTGCT TCTTTAAAAA TACCAAAACC AG T ATAAACG 54 0 

30 

ATTAGTAAGC CTAGTAATGT TGCTAAAATA ATATCGACAA TTGGGAAACC GATTTGCGTA 600 

AAAATTAATC CTATCGCTGT TCCAATGCTG ACTAAACTAT C CGAT AAATT ATCTTTGGCA 660 

3& GCCGAATTTA AAGAACTACT TTTCGTTCTT TTCGCTAGTC TTTGATTGAC TGCAAATACA 72 0 

ATCAACATTA CAAGACCACT GATTAAGCTG ACGATAATTG TTATTGCGTT AGGTACAACG 780 

TCATCTTCTT TGAACAAACG AGGTGCATTT TGAATAACTA CTTGGATACC TACAAACATA 84 0 

40 ATGACAAATG ACACCAATAA TGAAGAAATA TTTTCAGACT TCAAATGGCC ATAAGGATGA 900 

TTTCGATCGG CAGGTTTAAT TGAAATTTTC AATCCAATAA TAACAGCTAA AGAAACGATA 960 

ATATCTGTCA TATTGTTTAA TGCATCGGCT CTTACAGCTG CAGAGTTAAA GACAAAACCC 1020 

45 

GTGACATACT TAACAATAGA TAAGATTATA TATACAATTA AACTCAAATA AGCACCGCGT 1080 

TGCGCCAATT TAAGATTTTC ATTATGAGAC ATGCGTTGAA CCACCTTGAA TTAGTATAGT 114 0 

5Q AACAATATTA TGAATGATTC ATTTTAATTT TACAACGTTT TTAATTTTTA TAAATTTTTA 1200 

TAAAATTAAA CTAATTTATT CATTCGCAAC CCCTAAAAAT AATTTTTAGC CTTTCTGCGA 126 0 

ATTTTATGAG CTAGAAAGGC GCCCAACTCT CCCTGTTTGT TAACTTTCGC CTCGAAAGTT 1320 

55 
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CGAAwvTTAT GAG CT AG AAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 144 0 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 156 0 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 16 20 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 16 8 0 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 08: 

GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 6 0 

GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAG CAGTAAG ATATTTTCTA 12 0 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 24 0 

TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 3 00 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 360 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCa TTT 420 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 4 80 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 540 

40 GC TTT AG AG A TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA AT AAAAGG CT 6 00 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 6 60 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 720 

45 ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 730 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 84 0 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

50 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 96 0 

AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 

55 
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AAAAATGAAT ACCAGCAACT TCATCAATAA 
CTTTCTCCAA AATATCATTT AAAATACGTG 
° CTGATAATGT ACGAATCTCA CCTGTAATAT 

ACATAGCTGG GTATTTAGGA ACTAATGTTC 
TATGTTTTGC TTCAATGATT AATTCTTTAT 

10 

CGTCATCTAA TTGCCATGGA TGAACAATCA 
GCGTAAATTC ATTTTCTAAT TGTTTAATTA 

i$ CATAATCTTT AGACAGTGAC ATAGTACGGC 

ACTTAATAGG TTGATTGAAT TCTGAAGAAT 
TACGTAATTT AGCCCCAGGA TGTAGCGGAT 

20 AGTAACTATC TTCGCTATTT TCGATAATAT 

ACAGTGCTTG ATAGCTAATT GCAAATGTCA 

GAAATTGATC ACTAGCAGCA TTTTTTAAAT 

PS 

TTTCATTTGG ATGGTGTACT CGTGTAATCG 

GGCCTTCAAC ATCAATTCGA TCAAAGGCGT 

GCTTAGCTTG TGGGAACTGG ATTTCTAGTA 

30 

AAT CACG ACT CAATATTTTC TTACTTTGAG 
TTGATGTGAC CAATCTCTGA GTGACTTTAT 

35 TATTAGCCCA ATCGCTATTA TGTTGTTGTA 

TAAATTTTAA TGTCTGTTCT TTAAAAATTA 
TATTTTACAA AAGCAAGATA GATTTG TATA 

40 CAACAGAATG CGGGTGTAAG TTTTATGACA 
TTTCTAGGTA ATTGGATTGG ACAAATAGGG 
AACGCAGTTT ATCTGGGGAT TGTCAATTTT 

45 

GTGTGGGCAG GGG CAATTGC CGATAAATAT 
TCATCATCAT TTTTAGTAAC TGCAATTTTA 
CAATTAGCGT CATTATTATA TATGCGACAT 

50 

CTTTAAGACA AGCAATCTTA CCAGATTTAT 
CATTTCATTC ATTCATCATT AATATTTGTC 

55 



TTGTTGATGC ATAGGACTTA AATATGACAT 114 0 

TCATTAGTGG ACCATTGTGC GTCGTTTGTT 12 0 0 

GAACGTTTGT CGACAATTTG ATGTGTGGCG 126 0 

TGAAAGATAA ACCAGCATAA TAATCCaACG 132 0 

CTACTTCTGC TTGATAATCA GAATGTAATA 13 8 0 

TAATGTGATA ATCATTAAAG TTAAA CTTTG 144 0 

AGTCCGGAAA TAGTTGATGA ACAGTTGTAT 1500 

TTAATTTACT GTGAATCAGT ACTATTTTCA 15 6 0 

ATAGGAATGT TTGTAATGCA TTTAAACCTT 162 0 

GACCTTCAAT AACGGCTTGC TCTGAACGCA 1680 

TAAATAAAGG TGCAGAATCA TGTTGCATTG 174 0 

TATtAGTtGC ACTGTTTATT AAATCTTGCT 1800 

CTGGTGCTTC AATTAAAATA CACTCAAGAA 186 0 

TATTTGTAAT GTCATCTTTA ATGTAGAAAG 1920 

GTTCTCCAGT GATAGGAGCA TATAATGTTT 1980 

TATGAGTCGT TGAGATATCT AACATAATCA 204 0 

TGCGCGCTTT AACTAAGTTT TCGCGATGCA 2100 

CTCTTCCTTG TAAAATCATC TCTTTAAAAA 2160 

AAAATAAATA TGTTTCTTGT TCTTCTTTGT 22 2 0 

AGTTCAAGTT CATAATTCAC CTCTATGAAA 22 80 

ATCCATATTA ATGATAATGA yTCTTATTAT 2 34 0 

AAATATTTTT TTAGCAGTTC TTTTCTACTA 24 00 

CTAAATTGGT TTGTACTTAC CACTTATCAT 2460 

TGCAGACTTG TACCAATATT ATTACTAAGT 2520 

G AT AAAGGG C GATTGCTGAG AATTACAATT 2580 

TGTGTGCTCA CGTATAGTTc ACTGCAATTC 264 0 

TAAGAGGGAT TTTAAGTGCG GTTGAAACAC 27 00 

CAGATAAAAT ATCTACTACA CAAGCTGTmw 276 0 

GTTCAATAGG GCCTGCCATT GCTGGTGTCA 2 820 
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CAGTTTTATT ATGCTTACCA TT AC ATT IT A AAGTAACTAA AATACCTGAA GaTGCATCAA 2 94 0 

GaTACATGCC GTTAAAAGTT ATTATAGATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3 000 

5 TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACGACA CTTTTACCAG 3 06 0 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTGAAATATT TGGTATCGCT ATGACGATGT 312 0 

GTGCCATTGG TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 31B0 

10 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 324 0 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 3 3 00 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 3360 

IS 

TACTGAGTAT T a TT ATGATG GgATAGAGGT ATGaTTCCAT kGGGAAGTCn ATTAATGAGT 3420 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 34 8 0 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3 54 0 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3 50 0 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 3 660 

25 AATTGAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3 72 0 

AG AACG CT AT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3 78 0 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 384 0 

30 

GCGACTACGC GAAGAGTTAA TACACAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3 960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4 02 0 

35 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 4 0 80 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGTTGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAATAATTC CCGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 

AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 432 0 

45 GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCCGTTTAC 43 80 

AATACTTcCA AAAGCaACTT aTCatTTaGG ACGATGTCTT TAATTGATAA ACcATACCAT 4440 

GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT 4 500 

50 

ACTGTAGATG G AC CAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4620 

55 
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AGTGCAAGTC TAGTTAATAA AAATC CAATA GATCAAAAAG TTATCGTGGA TAGTTACTTA 4 74 0 

GAGTGGTTAA ATCAAGGAAT TACTAAAGAA AGTATTACGA CATTTATTGA ACGATACGCT 4 800 

5 CAAGCATTAA TCCCGCCTTT AATTG CTTTT ATTCAAAATT ATGGAATTGC TTTAGAAGCA 4 86 0 

CACATGCAAA ATACAGTAGT GAACTTGGGG CCACATTTTG ACaTTCAATT TTTAGTGAGA 4 92 0 

GATTTAGGTG GTTCTAGAAT TGATTTAGAA ACATTAGAAC ATCGTGTATC AGATATTAAA 4 980 

10 

ATTACAAATG ATAGTTTAAT AGCTGATTCT ATAGATGCAG TGATTGCAAA ATTCCAACAT 504 0 

GCTGTTATTC AAAATCAAAT GGCAGAATTA ATCCATCATT TTAATCAGTA TGATTGTGTT 5100 
GAAGAAACCG AATTATTTAA CATAGTACAG CAAGTAGTAG CGCATGCCAT TAACCCAACA 5160 
CTACCACATG CAAATGAGTT AAAAGATATT TTGTTTGGAC CAAGAATTAC TGTCAAAGCG 52 2 0 

TTGTTAAATA TGAGAATGGA AAATAAAGTA AAGCAATATT TAAATATTGA GTTAGATAAT 52 8 0 

20 CCGATAAAAA AAGAGGTGTA GTACTACATG GCACACGTTA ACATAAATAT ATCGAAGATT 534 0 

AAaTATAACG CCAAAGTACT TCAAACAGTT TTTCAAAGTA AAAATATGCA ATTCACACCA 54 00 

GTAATTAAGT GCATAGCTGG TGACCGTACA ATTGTAGAAA GCTTAAAAGC GTTAGGTATC 54 6 0 

25 AATCATGTTG CAGAATCCAG ATTGGATAAC ATAATTAGTA TTGCAGATAC AGGATTTAAC 552 0 

A 5521 
(2) INFORMATION FOR SEQ ID NO : 4 09: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: ! 

40 AGGTCGTCTr GCaGmAGmGm TATTTGACGA 

TAACATACTG TCTGATTATA TTGAGACACA 
TGATGATTTA TATGAAGAAT ATACGGAATG 

45 ATAGTTAGAA CCAGGGTGAT GCAATTCGTT 

TGTGACTAAA AATTAATCAA TTATAGTGAA 
TGATTAACTA CACAGAATTT AAAAGTACAT 

50 

GATAAGCAAC ACACATCTTC ATCCGATGAT 
GACCAAGAAA CTAATTCATC GAAACGCGTT 

55 



!EQ ID NO: 409: 

TCTTGCTTTC CCAAAACACG ATGATGATTT 6 0 

TGGTGATTTC aCATTGCCAA TGTCTGTATT 12 0 

GCTAAAATTT TAATATAATT TTTAATAATA 180 

ATCCTGGTTT TAATTTAAAA TAAACTAAGT 24 0 

ATATGGTGCG CTATCTTGCA TAAATTGATA 3 00 

AATACATAAT AAGGAAGTGA TACAATGGAT 36 0 

GAACGCGCTG AAATTGCAAC AAGCAATCAA 420 

CACTTAAAAC GTTGGCAATT CATATCAATA 4 80 
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CAAAAAATAA GTGGTTTAAA CAAAACTGAT CAAGCAAACT TAAATAAAAT TGAAAATGTG 6 00 

TATAAAATCT TAAATAGTGA TTATTACAAA AAACAGGACT CTGACAAGTT AAGTAAAGCT 660 

GCAATTGATG GCATGGTCAA AGAATTAAAA GATCCTTATT CTGAATATTT AACAAAAGAA 72 0 

CAAACGAAAT CCTTTAATGA AGGTGTTTCA GGTGATTTTG TAGGTATTGG TGCAGAAATG 780 

CAAAAGAAAA ATGATCAAAT TATGGTTACT AGTCCTATGA AGGGATCTCC AGCAGAACGT 84 0 

GCTGGCATTC GTCCTAAAGA TGTCATTACT AAAGTAAATG GAAAATCAAT TAAAGGTAAA 900 

GCATTAGATG AAGTTGTCAA AGATGTTCGT GGTAAAGAAA ACACTGAAGT CACTTTAACT 960 

GTTCAACGAG GTAGTGAAGA AAAAGACGTT AAGATTAAAC GTGAAAAAAT TCATGTTAAA 1020 

AGTGTTGAGT ATAAGAAAAA AGGTAAAGTT GGAGTTATTA CTATTAATAA ATTCCaGAAT 1080 

GATACATCAG GTGAATTGAA AGATGCAGTT CTAAAAGCTC ACAAAGATGG TTTGAAAAAG 114 0 

ATTG T TT T AG ATTTAAGAAA TAATCCAGGT GG ACT AC TAG ATGAAGCTGT TAAAATGGCA 12 0 0 

AATATTTTTA TCGATAAAGG AAAAACTGTT GTTAAACTAG AnAAAGGTAA AGATACTGAA 12 6 0 

G 1261 
(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

AAATATATTG AAnAGAnAAT TACTAAGATT AAATCnTCTT AAAATATCCC TGAAATAACG 60 

TCCTAAAGAT TAAAGGAAAG AGGTTATAAG TTATGCCAAA ATTAATTTTA TGTCGTCATG 12 0 

GACAAAGCGA GTGGAATGCT AAAAACTTAT TTACTGGATG GGAAGATGTT AATTTATCTG 180 

AACAAGGTAT TAATGAAGCG ACTAGAGCAG GTGAAAAAGT AAGAGAAAAT AACATTGCCA 24 0 

TCGATGTAGC TTTTACATCG TTATTAACAC GTGCTTTAGA TACAACGCAT TATATTTTAA 3 00 

CTGAATCTAA ACAACAATGG ATTCCTGTAT ATAAAAGCTG GCGTTTAAAT GAACGCCACT 3 60 

ATGGTGGATT GCAAGGCTTA AATAAAGATG ATGCTAGAAA AGAATTTGGA GAAGAACAAG 4 20 

TACATATTTG GCGTCGTTCT TATGATGTGA AACCACCTGC TGAAAC CGAA GAACAACGTG 4 80 

AAGCTTACTT AGCTGATCGT CGATATAATC ATTTAGATAA ACGTATGATG CCTTATTCTG 54 0 

AAAGTCTGAA AG AT A CTTT A GTTCGAGTGA TACCATTTTG GACAGATCAT ATTTCACAAT 600 
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TTAAATATCT TGAAGATGTG TCAGATGAAG 
CACCGCTTGT TTATGAATTA AC GG AT G ATT 
5 AaAAGAGCTG CATGTACACA AGGAGTGAGT 

TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT 
GTTAGACATA CAATATAGTC AAATTAAAAC 

w 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG 
TGTAAATCTA ATTTCATCTC TATCCCTCCT 
TATGTAAATA GTAATGATTA CGTTTTAAAG 

15 

GAAAATGGGT ATAATTACCG GATATCTAAA 
GAAGCTTTAA ATGTCGAAGA AACAAAAATT 

20 ATTTTTTGGA TTATTAAATG AAACACTATT 

TTTTGAAATT TCATATACAC AAGTTCAATG 
GATTGTTATT CCTTTGTCCG CGCTTGTTAT 

25 AGTGGGTATT TCTATCTTTT TCTTAGGTAC 

AACATTATTA GTTGCTAGAA TTATTCAGGC 
GATGACAACG ATTTTGGATG TTTTCCAACC 

30 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT 
ATATTTTAAC TGGAGATCGC TTTTCCATGT 
AATTGGaTTT AAAAcGATAA AAAATGTTGG 

35 

TTCTGTCATT TTTTCTGTAC TAGGTTTCGG 
AGAAAAAGGT TTTGATAATC CTAcGtATTA 
GCATTATTTG TAwTACGTCA ATATCGGCTA 

40 

AAAAATAAAC AATTTACAGT TGGTATCATT 
GGTTCGGAAA CGATTTTACC TATCTTTGTG 
45 TCTGGATTAA CTTTATTACC AGGAGCAATT 

GCTTTATATG AAAAGTTTGG TCCTAGAAAT 
ATTACTACGG CTTATTTTGT TGTAATGGAT 

SO 

GTTTATGCGA TTCGAATGGT TGGTATCGCG 
ATGAATCAAT TAAAGCCAGA AATGAATGCA 

55 



ATATCATTAA TTATGAAATT AAAACAGGTG 720 

TAGAAGTTAT AGATAAATAC TACTTATAAA 7 80 

GTATATGcAG CTCTTAAAtA TGTGAAGTAA 84 0 

ATTAACGAGC TAGGGATACT CGAAAATATA 900 

AATTATTTCG CTCTTTTATG TTGCTTAATA 960 

TTAGGGCTAT TTAAATTACG ACGAGCAGTC 1020 

TGTAAATATA TTATGACCGA TAACTACTCA 108 0 

AAATTGTAAT AAAGTCGTGC TAATTTTTTG 114 0 

AATGTGTGTC GTTTTTTAGA TGGTGAGGGG 12 00 

AACGATGATT ATT ACTA TG C TGATGGGTGG 126 0 

AGTGACGGCT TTACCAAGTA TTATGAAAGA 13 20 

GCTGACAACA GCTTTTTTAT TGACTAATGG 13 8 0 

ACAACGTTAT ACAACAAGAC AAGTGTTTTT 144 0 

ATTACTCGGC GGCTTGAGTC CGCACTTTGC 15 00 

GTTAGGCGCA GGTATTATGA TGCCATTGAT 156 0 

ACATGAACGC GGTAAATATA TGGGGATATT 162 0 

TGGACCTACT CTTTCAGGTT ACCTTGTTGA 16 80 

TGTCGCTCCA ATTGCAG CTG TGACATTTTT 174 0 

AACTACAATT AAAgTACCTA TTGATTTTAT 1300 

CGGGTTATTG tATGGAACGA GTTCaATTTc 1860 

GTATCTATGA TTGGAGGCGT TGTTTTAGTC 1920 

TCAACACCAT TATTaAATTT TGCTGTATTT 1980 

ATTATGGGTG TCaCAATGGT ATCGATGATT 2040 

CAAAATTTAT TGCATCGTTC AGCTTTAGAT 2100 

GTTATGGCAT TTATGTCGAT GACTTCGGGT 216 0 

CTTGCTTTAG TAGGTATGGC GATTGTTGTT 22 2 0 

GAACAAACAT CAACAATCAT GTTGGCAACA 22 80 

TTAGGATTAA TTCCAGTAAT GACCCATACG 234 0 

CATGGTTCAT CTATGACAAA CACAGTACAA 24 0 0 
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10 



30 



40 



50 



AACTTTTCTC CAACTATGTC AGACTATA 24 8 8 

(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TTTTACAATT TCAGATATCT CTAAAACACA TACCGATCCA ATGGCCAAAA TTATTCGTCA 6 0 

AAAATTGAAA AAACTAGGAA TTCGTAAAGG GATTCCAGTT GTATTTTCAG ATGAAAGTCC 12 0 

AATTGTCATA AGAGAAGATG TAAAAGATAT AGTTGGAGAT AAAAATGCTA TCAATCGAAA 180 

AGGACAAATG CCACC1TCTT CAAATGCCTT TGTGCCAAGT GTTGTTGGAT TAATTTGTGC 240 

AAGTTATGTG GTGAATGACG TATTAAAAGA TATTCCAGTT CGTCGCATTA AAGACAAAGG 30 0 

GCAATAATTC ATTTTGAAAG GGATAATTTT CAACGTAAGG CAAGTGTAAC CACACCATAA 360 

AACTAAAACT GACTAGTTCG CAAGCATAGT AGAATATGCC TCGTGTACTA GTCAGTTTTG 42 0 

ATTTGATTAT AGCTAAAATA TCAAAGGTAA TCGATTTAAT CTTATTTTTA CGGCCTTTGT 4 80 

TTAAGTAAGT TGTTATATAT TTCTTTGAGT TGTTGTTCAC TTTTAGATGT CGTTTTTGGT 54 0 

TCGTAATAAA TTTTGTTTTT TAGTTTATCA GGTAAATATT GCTGTGAAAC ATAGCCATTA 6 00 

ACATATTGGT GTGGATATTT GTAACCAATA GATCGGCCTA GATCTTTAGC ACCTTGATAA 66 0 

TGTCCATCTT TTAAATGGTT TGGAATTTGG CCCACATGAC CGTTTCTAAT ATCGGACAAT 720 

GCACTATCAA TGGCACTCAT TGCTGAATTT GACTTAGGTG AT AAG CAT AG TTCGATTACT 780 

GCTTGGCTTA GTGGAATTCT AGCTTCTGGT AAACCTAGAC GTTCTGCTGA TTCAATAGCA 840 

GCAAGTGTTC TCTGACCAGC ATTAGGTGAG GCTAAGCCTA TATCCTCATA GCTAATTACA 900 

AGTAATCGTC GAACTATTGT AGGTAAATCT CCAGCTTCAA TTAATCGTGC TAAATAATGT 96 0 

AAAGCGGCAT TGACGTCGCT ACCACGGATA GATTTTTGGA AAGCGCTCAT AACATCGTAA 102 0 

TGCATGTCAC CATCCTTGTC ACTTACAAAT GCACCTTTTT GTAAACAGTC TTTAGCATCT 1080 

TGCAATGTAA CATGTCGATA ACCGT 1105 
(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTT AG CAT A GT CATTT AAA 6 0 

5 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

10 ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 24 0 

ACATCTTCTA AATCGTTAAA TTT CACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 3 00 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 3 60 

>5 AATTG AT CTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 4 20 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 4 80 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGACCTnC TACAACAAAA 54 0 

20 

TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 57 9 

(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 
GTGGTGAAAC TTCTTGCTTT GTAATTTTAT AAAGTGGATC AATATCACTC TGACTTACAT 60 

35 CAGGTTGAAC TGTCATCTTT TTAGTTACTT TGTTTTCAAG CGTAATATTT GCTAATTcTA 120 

GGtTTTACAG AATAATGAAT GTCATCAGCT AmCCCTTTmm CtTGATACTy CTCTTGATAT 180 
TTTCCAGCTT TAGCATTTGA TAAATCAATC ACTACTCTTA AATCTTCTGG ATTTTCAATT 24 0 

40 TTTATTATCT TTGATTGTGG TCCTGAAATA GTCACATTAA CTGTTTCAGG CGCTTTTGTT 30 0 

AAATGCAAAT CTTTAGTGTT ATAAAGAATT TCAACGGGTA CATCTTGAAT CGTTTTACTA 36 0 

GACTTTTGAC CAAGATTACC AGTGTTAAAG ATATTTCCAA AAACATTGTT AACAGATAAA 42 0 

45 

AAGAAAAACA ATGCCAAAAG AAAGGCAATA AATCTCAAGC CCCATTTACT TTCTAGCATA 48 0 

TTATTTCACA CCTTTCTTTT GAAAGCGTGT GCCAAACCAA TGTTCAGCAA GCAACTCTTC 54 0 

AAAAATTTCG TTTGAAATGT CTCGTCGTAA TTTTCCATCA AATGTTACCG AAATATCACC 600 

50 

AGTTTCTTCA GATACAATAA CGGTAAATGC ATCAGATACT TCTGAAATAC CAACCGCAGC 660 
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w 



15 



20 



35 



40 



45 



TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 78 0 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT GAATCCATTG CAATACCTGT 84 0 

TTCAATATAA TCTTGAAGAC CTGTTTCTTT TTCAAAGACA ATTAATGCAC CTATACGTCT 90 0 

TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 96 0 

ATACGTATTA GAAGTATAGC GTTTTAAAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 1020 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTGAA TAACGATATC 108 0 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAACAAT 114 0 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAGACCG TGATGAGAAG 12 00 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATTT TTAACGTACT 12 60 

GAGGTTTTGA AAAAAGTTGG AAAAATCCAT AACATCTCCT CCGGGTAATA TTTTTCCATA 1320 

ATACCCATTA TACCAATCAT TT 1342 
(2) INFORMATION FOR SEQ ID NO: 414: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

CTGTATAAAG ATGGAGGTGT TTTGCATGGT AAAACGTACT TATCAACCAA ATAAACGTAA 60 

ACATAGTAAA GTTCATGGTT TCAGAAAACG CATGAGCACA AAAAATGGCC GTAAAGTTTT 12 0 

AGCGCGCCGT CGTCGTAAAG GCCGTAAAGT TTTATCTGCA TAAGATCACT GACTCATCAG 180 

TGAtCTkTTT TTTCGTTTAA ATTAAGAATA AATAGAAATT TATGTTATAA GCTCAATAGA 24 0 

AGTTTAAATA TAGCTTCAnA TAAAAACGAT AnATAAGCGA GTGATGTTAT TGGAAAAAGC 3 00 

TTACCGAATT AAAAAGAATG CAGATTTTCA GAGAATATAT AAAAAAGGTC ATTCTGTAGC 3 60 

CAACAGACAA TTTGTTGTAT ACACTTGTAA TAATAAAGAA AT AG AC CAT T TTCGCTTAGG 4 20 

TATTAGTGTT TCTAAAAAAC TAGGTAATGC AGTGTTAAGA AACAAGATTA AnAGAGCAAT 4 80 

ACGTGAAAAT TTCAAAGTAC ATAAGTCGCA TATATTGGCC AAAGATATTA TTGTAATAGC 54 0 

AAGACAGCCA GCTAAAGATA TGACGACTTT ACAAATACAG AATAGTCTTG AGCACGTACT 600 

50 TAAAATTGCC AAAGTTTTTA ATAAAnAGAT TAAGTAAGGn TAGGGTAGGG GAAGGAAAAC 660 

ATTAACCACT CAACACATCC CGAAGTCTTA CCTCAGACAA ACGTAAGACT GACCTTAGGG 72 0 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAATTG GAATTGTTCG B4 0 

ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 900 

5 

AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 960 
TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 1020 
TTATAGAgAT TAATTGTCAT GGTGGtATTk. TAACGATTAA TAGAGTGCTG GGA 1073 

10 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

CTTTACCAAT GCCAAATCCG AAGTAAAGTA TAGCAATAAA GATTACTAAT ACAATTCTGT 6 0 

AAATGGCAAA TGGAATTAGT TTGATTTTGT TAATTAGATG CAAGAATGTT TTGATTGCAA 120 

25 TTAGTCCAAC AGTAAATGCA GCTAAAAAGC CTAAAATATA AAAAGGTATA TCAGCAATCT 180 

GAATATCTTG ATAATGTTTT AATAAAGATA AACCACTAGC TGCTAACATA ATTGGAACAG 24 0 

CCATAATAAA TGTAAAGTCC GATGCTGCTT TATGATTTAA TTTCATTAAT ACCCCAGTTG 3 00 

30 

AAATTGTTGA GCCTGAACGG CTGAAACCAG GCCACATAGC TACTGCTTGA gAAATACCAA 36 0 

TTACAAATGC TTGGAAATAA CTGATTTGAT CTACTGTTTG TGGGTTTTTA ACTTTAGCTG 4 20 

AGTATTTATC AGCAATAATC ATATAGATAG CACCTACGAA TAAGCCAATC ATAACAGTTG 4 80 

35 

GCACACTAAA TAAATGTTCT TCGATGAAAT CATCAAATAG TAAGCCTAAA ATACCTGCTG 54 0 

GCACCATACC CACTAATACA TGTAATAAAT TTAAACGTCT TGGCTTTGAA CGTCTTTGTT 6 00 

GATCGTTATC TCCTTCAACA TGTTTGTGTT TACCAATATG TAAAATCTCT AAGAAGCGTT 660 

40 

CGCGGAACAC CCATGCTGCT GCAAAGACGG ATCCTAATTG GATGACGATT TTAAATGTAA 720 

ATGCTGACTG AGAACCTAAA AATTCAGATG ATTTTAACCA CATATCATCA ACTAGGATCA 7 80 

45 TATGTCCAGT AGAGGAAACA GGTGCAAATT CTGTTAATCC TTCGACGACC CCTAAGATAA 84 0 

TACCTTTTaT TAATTCAATG ATAAACATAA TGTACCCACT TTCATTACTC AATTTAATTT 90 0 

ATTTAAATAT CAAAATTACC ATATCATGAT AGCATATTCA TTTAAAGACA TGCTAGTTAT 96 0 

50 AGTTATAATA CT AG A CTAAA GATGTATATA TTCATTTTCT TTTACATGTA AAACTACAAT 102 0 

ATTTTATTGA GCTATTTAAT TTGATTTTAA GGAAAACCTT TTATAATAGG TTTAGGTGAT 10 80 

55 



1314 



EP0 786 519 A2 



TTCTTGGTCA GTACTGGTCT CGGCATAATC 
TTTTTAGCTA AAATTATAAG ACATCAATTT 
5 TTAGGTGTTT TACTTTTAAG AGCAACTGTG 

TTAGCATTTA AAGTTAAGCA TATGCTTAGA 
CCAATCGGTG AACAAATGAC TATACTCACA 

10 

AAGAGTTATT TGCCTCAAGT GTTCAAATCA 
ATGTTTTTCA TCCATTTCAA TACCGCATTA 
TTGTTTTATA TTATTTTCGG TTTGAAAACG 

15 

TTGAATCAAT TTAGTCAACG GTTTTTAAAT 
TTTAATCGTA CAGAGCAAAC AGAGAAGCaT 

20 TTAACAATGC GCATTTTaCG CAGTG CTTTT 

ATGTTAGGTA TTGGATTGGT TGCATTGGAA 
ATTGATTTTA AAACTGCGGC AATTGCGATT 

25 AAGGACTTAG GGCAAGCGTT CCATACTGGA 

TTTGAGTTTT TAG AACAAC C GAACTATAAT 
CAAAAGCCAT TTATTCAGTT AACAGACATA 

30 GTATTAAATG ATTTAAATTT GGAAATATTT 

AGCGGGGCAG GTAAATCCAC TTTGACACAT 
GGTACTATAA GTACAAACCA GCGTGATTTA 

35 

ATTTT CAGTG CTTCTATAAA AGAGAATATT 
ATTGAAGAAG TGCTAGACGA AGTAGGTTTA 
ATT AA C A CAA TAATAGGTGA AGGAGGCGAA 

40 

GAACTTTGCC GTCTTTTAGT TATGAAGCCA 
GGTTTAGATA TTCAAACAGA ACACATGATT 

45 ACAACGATGA TTGTCATTGC ACATAGAGAT 

TATATAGAAA ATGGAAGACT GATTGCTGAT 
AATGGTGATG ACTTATGAAA ACACGACTAA 

50 CTATAGTTGT TGGTGTTTGT GGAAGTTTAG 

ATATGGTGAC ACAAAGTGCA CTTGGTGCGC 

55 



GTTATAACGC AAAAT ATTTT AATAGCAGAT 12 00 

CAAGGTTTAT GGATTGTATT ATTTATTTTA 126 0 

CAATTTCTAA ATCAATGGTT AGGTGATACA 13 20 

CAGCGGGTTA TTTATAAAAA TAATGGTCAT 13 80 

GAAAACATTG ATGGTCTAGC ACCTTTTTAT 144 0 

ATGATGGTTC CGCTCATCAT AAT CATTGCA 1500 

ATTATGTTAA TAACTGCACC ATTTATTCCT 156 0 

CGAGATGAGT CAAAAGATCA AATGACTTAT 162 0 

ATTGCTAAAG GTTTAGTGAC GTTAAAGCTA 16 8 0 

ATTTaCGACG ATAGTACTCa GTTTAGAACT 174 0 

TTATCGGGAT TAATGCTCGA ATTTATAAGT 18 00 

GCAACGCTAA GCTTAGTAGT ATTTCATAAT 18 60 

ATTTTAGCGC CTGAATTTTA TAATGCAATT 192 0 

AAACAAAGTG AAGGTGCCAG TGACGTTGTG 1980 

AATGAATTTC TATTAAAGTA TGAGGAAAAC 2 04 0 

TCATTTCGAT ATGATGATTC TGATAGATTG 2100 

AAAGGTGATC AAATTGCACT TGTAGGTCCA 216 0 

CTTATTGCAG GTGTTTATCA GCCAACAATA 222 0 

AATATAGGAA TACTTAGTCA ACAGCCATAT 22 80 

ACGATGTTTA AAGATATAGA AAATAATACT 234 0 

TTAGACAAAG TGCAATCTTT CACAAAAGGC 24 00 

ATGTTATCTG GTGGACAGAT GAGACGCATA 24 60 

GATCTCGTTA TATTTGATGA GCCTGCAACT 2 520 

CAGAACGTTC TGTTTCAACA TTTTAAAGAT 25 80 

AATACAATTC GCCATTTACA ACGACGCTTG 264 0 

GATCGCAATA TTTCAGTAAA TATAACAGAA 2700 

AATTTCAAGT AGATAAGGAT TTATTGTTAG 276 0 

TTGCGCTCGC CATGTTTTTC TTAAGTGGTT 2 82 0 

CACTATACGC TCTGATGATT TTAGTCGTTA 28 8 0 
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ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGT AAATTAGTAA 3000 

ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CGTATGATTA 306 0 

GTCGTGTTGA GGCATTACAA AATATAXATT TACGTGTTTA TTATCCACCA GTCGTCATCG 3120 

GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 3176 



(2) INFORMATION FOR SEQ ID NO: 416: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

20 TCTTTATTTC AATTTCCAAT TGAATTTTTT TATTATTTAC GCATAGCTCT TAAAATTAAC 6 0 

GTTACGATTG CAATTAAGAT AATTGAACCA ATTAATGCTG GCAAGATGTA AATACt TCCT 120 

AATTCAGGAC CCCATTGTCC GAATAGTTTG CCACCTACCC ATGATCCAAT AATACCTGCG 180 

25 ATAATATTGC CTAAAATACC ACCTGGGATA TCTTTACCCA TAATAGCACC AGCAGCCCAT 24 0 

CCAATTAAGC CACCGACAAT TAACATTCCA AT AAA T C C AA ACATAATTTT CAGTCTCCTT 3 00 

TTTCTATTTA TTTTGCGTTA TTCTAAGTAG TACCCCTTAT TTACAATTCT AAAACAAATT 3 60 

30 CAAATTATTT TTATCCAAAT ATTTTTAAAA GTAGTAATTG AATATCAATT TTATTCAATG 4 20 

TAGCTATCGT TATTTAAAGT CTCTGTACCG ATAATATCAT ATA CATTT AC ATTATTTTTT 4 80 

CTGCCGAATT CATAGCTTGA TTATTTTATG TTATAGGACT AgAATATACA CATATTATTA 54 0 

35 

GAGCATCTTT GAATTTTAAA TCAAGAAGCG AGGTTAATGA ACAATGAATA TGCATATTTT 6 00 

ATATaACTTA CGAACTAAAC ATAATTTAGA AATTGACGAA TTAGCACAGC AATTAAATGA 66 0 

GAAATATGGT ACTAAATATG AAGCACATCA AATTTGGGAA TGGGAGAATC ATCACCATGA 72 0 

40 

ACCTAAATTT AAAGATGCCA TGCATTTAGC TGACTt CTTT GATGCACCAT ATGAAATGTT 780 

TTTAGAAAGT AAGGTTAAAG AATATCAGAA ACATTTAGAA GAAGTCGATA TTCGCATGGA 84 0 

45 TAAATAGATG CAAATAAACC CTCACAACAC GTTTGGCATA TATCCTTTCA AATCTATACT 90 0 

GGATATATTA CATTACGTTG TGAGGGTATT TTATTAATTA ATATGAATTA AGACATTTTA 96 0 

CAAGCGTTAA TGCAACGAAT CTTTTTAGTG ATCTTGCTCA CTCTTTAATA CTTTACCGTT 1020 

SO CTTAGCATCA ACAGTAACTT CTTGTTTTTT ATTACCTTTT TTCAAATCGA TATTGTAAAC 108 0 

AAGTTTGCCA TCATCTTTTT CAAGTGACCA TTCTTTAATA TCACCATCAA ATTCTTTTTG 114 0 
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w 



ATTCATTGTA TCTTCTTTTT CAGTCTTTTT GTTAATCACT TTTTTATTTT TATCAGCAAC 1260 

AAGTACTTCT GACTCTTCAC CAGATTTTTG TTGCGTCACT TTATAAGCCC ATTCACCATT 132 0 

AGAATTTTCA AATGAAATTC CTTTCAACTT TTGGCCTTTG TAAGTTTCTT CAGCTTTTTT 13 80 

CACAGCATCT TCTGGGCTTG TTTTAACATC TTTTAAAGCA ATAACATCTT TTGTTGTATT 14 4 0 

AGTGTCTTGA TTAGTATTTG ACTCTGTTGA TTTTGTTTCA TCTTTTGGAG TATCATTGCC 1500 

ACATGCAGTA AGCACCACCG CTGACATTGA TAACACTGCT AATGATTTTA ATTTCATAAT 1560 

ATCACTCTCT CTTCCTATTT TTGAAACTCA TAACAAAAGC TTATATGCTA TATAGATTGT 1620 

ATTACCCCTT GTTTTTAATT TTATTCATAA TTATTACAAA TATTTTTAAA TTAATCGTCA 1680 

TGCGTTACTT TCGTTCGTAC TCCTTTTATA AATGAACCAT GTAATATAAG CATGCTATTA 174 0 

TCGACTCAGT TTGTCTAATG CTTTCTTTGG TACTTCTTCC TTTTCAACTT CTTCAAAAGT 1800 

TTCTACATGA TGACCTTTAT GTGTAATTTT CAAGTATCTA TGCGGTTTAA CATCAAATGT 186 0 

AGCAGTATAC ATTAATTGCG TCTCTTTCCC TTCTTTATTA AAAACACTTT GACTATAACT 1920 

TCGGAGTTGA TCATCCATTC TAGTAGACAC CTCTGTCGTT TTAACATAAG AATCATCCTT 1980 

25 CTTTACTAAT GGGTTAAATT GATCTGTTAT ACCATGGGAG TCTATTGTTT TTAAAATGAA 2040 

TAAAGCAgCA TAAGCGCCAA TGATAGTAAG TACAAGATAt GTkATTGTTT TTAAAATCGT 2100 

TTCAAAAGA 2109 
30 (2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 
35 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

40 

GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA GAACATAATT 60 

ATCAATTTGG TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGTTACTA AAGCCAGCCA 12 0 

45 ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCACTAAA 180 

GTTAGGAGTT ACTTTAATGT CCAAAAAACA TGTTTTTATA ATTATTGGTG TCATATTGTG 24 0 

TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 3 00 

50 AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 3 60 

TAAAGAAACG AACACGATTA AATCTGTACA TTTCACAAAC TTGGAAACAA GTCCTATGGG 4 20 
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ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATAA 54 0 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 600 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 660 

TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAwT AaTAGTTTGG 780 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 813 
(2) INFORMATION FOR SEQ ID NO: 418: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 6 0 

TACACTTTAC AAAATGAGAT GGAATCCTGC AC AT C CAGG A ACACACCAAT ATGCTACAGA 120 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACT AT GAT A AAATTGGCGA 180 

AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 24 0 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 3 00 

GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 360 

ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 4 20 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTAT CAAGT AATTGATTTA AATTGTCTAA 4 80 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 54 0 

TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 600 

GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 64 0 
(2) INFORMATION FOR SEQ ID NO: 419: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 



35 



40 
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15 



TTCTTGGATA CGTAATAAAT TACCTTTTGT CATAGGATTA AACATAGATC CAGGTAAAAC 120 

ATAAACATTT CTATTTTGCT CTAATGCAAA ATCTATCGTG ATGTGACTGC CACTTTGTTC 180 

CTTAGCCTCA GTAATTAAAA CCCCTTTTGA CAAACCGCTG ATAATT CTAT TGCGCTCAGG 24 0 

AAATCTATAT TTAGCGATTG GTGTATGTGG TGGATATTCA GATATAACTA AACCTTTTTC 3 00 

TTCTATTTTA TTTCTTAATG CTAATGTACT TTTGGGATAA TGTGTTTGAT GGCCAAAGGC 3 60 

TAAAACTGCA ATT G T AG GG A GATTGTATTT TAAAGCTATT TGATGTGCCA TTGCATCAGC 4 20 

TCCTTGAGCA AGGCCGGAAA CAATTGTTAA ATATTTGCTT TT AT CATTTG ATAATAAAAA 4 80 

TTCTAAAGAC TGTTGGGTAT AACTTGTAGA ATCTCTTGCA CCTACTACTG CCAAATGATG 54 0 

CATATTATTT ATTAATTTGA TGTTCCCTTT ATAGAAAAGA AGTAATGGAA ATTGATATAT 6 00 

TTCTTTTAAT AGCACTGGGT ATTCAGAATC CATATATGTA ATGTAACTTA CTTTTAATTT 6 60 

CTGCAGTTCT TTAATAATAT CGTTATGATC AAGTTTTATA AAAGCATAGT ATTTACGTAA 720 

20 

TAAATGAACA TTTTCTTCCC TATTCACCCA TTCACATAAA TAACTATCTT TTTTTCTTCC 780 

CTCCTCTTTA ATTACATTAG G ATATGC CAT TAAAAATTGA TGAATTTGTT TAGTCGAAAA 84 0 

25 GTGTGCCCAG TATAACTTAA GCAAAAATAG TTTAATCAAT AAATCAACTC CTTTTTGTAA 900 

TCATACAAAA TCATATTCTA TTTTTGTTTT ACATTTCTAA TACAAAAACA TTGTCGATGT 960 

AATGTTATTT TAAGGAGTAA AAAT A CTGAC TTAAAAAGTG AAAAGTATGT TGGAAAGAAT 1020 

30 TTAAAATTTC AATTTCCAAC ATACTTTATA ATTAAACCTT ATAAATAAGT TTTGCAAATT 10 80 

TATATAGAAT TGGTCTTACT GGTTTGATGA AATCACCAAT TAATTCTTCA ACATGTGCAT 114 0 

TAAAACCCTT TTTAAATTG llb9 

35 (2) INFORMATION FOR SEQ ID NO : 4 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
40 (C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 6 0 

CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 120 

TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 180 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 24 0 



55 



1319 



EP0 786 519 A2 



10 



15 



40 



ATCAGCTACT 


GAAAATGGTA 


AGCTTACATT 


TTTAPAAPTA 


111111 unnu 


TP ATPTPTPP 
1 Ln 1 \j 1 L. 1 LiL. 


3 6 0 


ATTTGGAACT 


TGTGGACTAT 


CGCTTGGTGT 


CAPAAPTPAT 


ATTAPTPATA 


TTTPTA A PPT 
1 1 1 V_ 1 Jvy\j\y 1 


4 2 0 


CGTACTAATG 


ATACTCATGT 


TTATAGGACG 


TG TTGG PTT A 


ATATPATTTA 


TP ATT ATP AT 


4 BO 


AGCAGGACGT 


CGAGAACCAG 


ATAAATTCCA 


TT AT CCAAAA 




A AATAPPATA 


54 0 


ATATAATAGC 


AATCTAAGTT 


TAGTTAATGT 


AG ATTTT AA C 


TPPA APTTAP 


/\ 1 i *jV_ 1 1 11 1 


600 


TAG TTTGTAT 


TTTTAACTTA 


TTTTATAAGA 


C GATTGGTTT 


PPAAAATPPT 


A A APTAPTA A 


c c n 
bbU 


CAATGAGAGG 


TGTAACATGA 


TGGAAAAAAA 


TGAAAACATT 


AATPTAPAPA 


t"I'it a a r"T* a r 1 

111 1 AL 


72 0 


GTCAGATATG 


CATAGTCATT 


TCTTAAATGG 


TGATTATGG T 


TPA A ATATTT 


A T & P a P f'TY^/'* 


78 0 


TACTTATGTT 


AACCAAGTAA 


GAG CACAAAA 


tcatpgcptp 


A TTTT A TT 7A C 
r\ 1 1 1 1/\1 1/iLr 


A 1 A(j 1 LtLjULAj 


84 0 


AAGTTTAGCT 


GGCTCGTTAG 


CGGCCTATTA 


TT ATPPT A TT 1 


PTTPP A rTTT 
oil uLHl III 


A I AAALOACA 


900 


TCCAATGATA 


AAGTTAATGA 


ACAGAATGCA 


TTATP ATPPT 


APPPPTPTP A 


r"TT^^a a^'po tv 


96 0 


ATTCAAGTTT 


GGTTTATCAT 


TTTTAAOTCG 

X X i A rWV- X Vw-VJ 




1 1 WjCAL 1 1 


TT C CATGG TT 


102 0 


ATCAGCAAAT 


ATTGAATACA 


ATGTTACTAA 


ppapppttat 


TTTTP A APTr* 


1 A 1 1 U 1 AI 


1080 


TAAACATTTT 


GGTGACTTAA 


AAATTPPTAT 


ppt a p p c p t p 




Lj 1 1 1 AA I CjOA 


114 0 


AAATGAGTAT 


TCTGAAATGG 


AG CAAG ATGT 


ATPTATTPA A 




1 LxUCATCAAA 


12 0 0 


ACGTTGGATT 


AG ATATAT CC 


ATGAAGTTGA 


APAPPPAPAT 


1 1 1 1 X <j>\ 1 1 Lj 


1 AA 1 1 I Al LLA 


12 6 0 


TGGTGGATTG 


AATAAAATTA 


GTAATAGTAC 


PA A A A AT A A A 


A APPPA APTT 


LoAA 1 vjAALjL 


13 20 


TGAAAAATTA 


ATGGAAGAAC 


T CGG TGTTAT 


AGATTTAATP 


ATTAPAPPTP 


A 1 L-AovJA 1 V_A 


13 80 


AACAATAGTA 


GGTCAAGATC 


ATGAAACGTA 


TT AT GTT CAG 

X * X VJ X X w 


nrrnriTP a a p 


aTnrr a a ar* a 


144 0 


GCTTGTACAT 


CTTTCGATTA 


ATTTTAAAAA 


GAGAACAACA 


APTT ATP ATP 
x x f\ x x \3 


ttp A A a r* a t 


15 0 0 


TGATTCTAAA 


GTGATTGACT 


TAAATGAGTA 


TGAAGAGGAT 


CAAPAATTAT 


TAPATTTA ap 


15 6 0 


ATTCTATGAT 


AG AAAAG CAG 


TGGCTTATTG 


GTCACAGGAA 


ATPATAAPTP 


ATA A APnTTT 

t\ 1 t\t\tV^J\3 111 


icon 


GATGTTATCA 


GTAAATGGGT 


TACAAGATTT 


AGTPTflTPAA 


APAPATPPAT 


lil ^wOV-AA I I 


1680 


ATTACATGAT 


GCAATTCACC 


TTGCATTTGA 


TAATGATATA 


ACATGTGTCC 


ACGTGCCTAT 


1740 


GaACGGAGAG 


AAGGGGTTGA 


GTGGACAGAT 


TCGAAATGAA 


GrTTTGTATC 


aTGCATACCC 


1800 


ATaTCCAGAT 


aAGCCAtGGG 


tATGACAATT 


aGTGGTCAAA 


ATaTCAAGGT 


ATnTTGGGGT 


1860 


ATAGTTATTC 


ACCATTAGG 










1879 


(2) INFORMATION FOR SEQ ID NO: 421: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2710 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA C CAT ATT ATT 6 0 

TnAAATTGGA nAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 120 

10 

AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 180 

AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 24 0 

15 TGTCAATCAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 3 00 

GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTACCAG 3 60 

TGCCTGGCGT TTCAAGCGGA TCTAAAGCAT CAACATCTAA AGATAAATGA AC GC CAT CAA 4 20 

20 CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC CCCAATTTAT 4 60 

CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 54 0 

TATCTAAATC TCTCATACCA ATTAGTACGA TGnTTTCTGG CTTGATTACA TTACnATTTA 600 

25 ATTCTAAAAG TTCTTTGGGA CCTTCGCCTG TCAAAATCCT TAGAGGCATA CCATGAATAT 660 

TTCCACTTGG TGACTCTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 72 0 

CTAAATTATT ATAATGTTTA CTTATTGCTG ATACTGAACC TACCGCAATA GAATGATCTC 78 0 

30 

CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 84 0 

TTTGATTAAC ATCTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 90 0 

TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 960 

35 

GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 102 0 

ATTTTCTTTG TCCAAATGTT GATGGTGCAC CTATAATATC AATTGCTTTT GTCTTTGTCA 1080 

TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAG 114 0 

40 

CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 12 0 0 

TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 12 SO 

45 ATAATTTAAA GTGACCAAAC CCAAACTAAA GGAGACAAGT GCGCCTATGT GTAATGATAC 132 0 

CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAACCAAG AAATTGACGT 13 B0 

CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 144 0 

50 CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 1500 

TCGTATTACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 1560 
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10 



GTGTTTTATC TCAAGATGTG TTACTCAAAA AGTTATAGAA GAAGCTACTA AAGTTAAAAC 16 80 

AGAGATTGAT ACTGCAGAAG ATAACTGTAT CTCTCCATCT ACTGTAAGTC GTATTAGAAC 1740 

TAAAGCGGCT AATTCATTAC GAATTAAACC CTTTAATTGT TTGCCAGAAC ACATCGCTAT 1800 

GGATGAATTT AAAAGCGTTA AAAATGTAAC TGGATCAATG AGTTTCATTT TTATAGATAA I860 

TGATACTCAT GATGTTATAG ATATTTTAGA AAATAGAACT ACAAGATTCT TGCGTGCCTA 1920 

TTTCGAGCGA TTCGATTTAA AAAATCGACA ACAAGTTAAG ACGGTTACTA TTGACATGTA 198 0 

TGAACCCTAT GTC CGATTAT TTCGCGACCT ATTTCCTAAT GCAGCTATTA TTTTTGACAG 2 04 0 

ATTCCATATC GTTCAACATT TAAATAGAGA ACTTAATAAG TATCGTGTAC AAGTTATGAA 210 0 

TGAATACCGT AATAAAAAAG GACCTGATTA TACAATTTTT AAGAATAACT GGAAAGTCCT 216 0 

ATTGATGGAT ACTAGTAAAA C CAT ATTT AG TAAATACAGA TGGAATAAAT CTTTTAAGGC 222 0 

TTATAAACGC TCATCTGACA TTGTAGAATT CATGCTTTCA AAAGACGATA TACTACGACA 22 8 0 

CTCCTACGAA CTTGTCCAAG GATTACGAAA AGACCTAAGG TTATGTAATT GGCCTAAATT 234 0 

TATTAATCGT TTAAATTCAG TTAGTAAAAA GTCTGTGAGT AAGGGTGTAT GGAAAGTGGT 24 0 0 

TAAATATTAT AG AAAACAT C AAAGGATGTT AAGAAATACA ATTTATTACC CAG CATTTAA 24 6 0 

TAATGGTGCT ATAGAAGGAA TTAATAATAA GATAAAATTA ATCAAGTGAA TTTCTTTTGG 252 0 

TTACAGAAAT TTCAACAACT TTAAAGCACG TATAATGATG aTTTTCAGCT TGTACAAAGG 2560 

AGaAAAAAAG AAGACAACCA AGCCCAATAA TGGACTGGCC GCCTAATAwA nGGGskCTAA 26 4 0 

AAGTTgTATT TTTAAAAATA GTTCCTTTAA ATTATATACC CnCCACATTT GGGGGAGGAC 27 00 
CTAAAAAAGC 

35 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



20 



25 



30 



2710 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGr AAA CC Ct GTGTTTTT 6 0 

AAAAGG t AC C ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 12 0 

GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 24 0 
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GGGTTACTTT GAT G AAAATG ATTGAAGTTG ATAGAATTAA TGACGTATCT TGTGTATAAT 360 

ACCTACGAAG TACTTTCATT GGAGGAAAAA TAGTGACTCT ATTTATTATT ATCGGGGTTC 4 20 

TCGTGCCAAT GGTTTATACC ATGCAGTTAA ATATTAAAAA TGAACCTGTa ACAAAGCGCA 4 80 

ATCTTTTAAT AACATTAGCT TTATCTACGT TAGGTATTTT AGTAACCGCG TTAGCAGGTG 54 0 

TAATCGTTAC GAAACAAGCT TTTCCTTTAT TAAGTGTAGC AATTGGCTCA ATTTTTACTG 6 00 

GAATCGTTTG GGGCC T TTTA CTAAGTGGTA GcTACGCGCT GATACGATTT TTATCTAACG 660 

CATTTGGGCG TAAGTAATAG TAATCTGTTC ATCAAGTAGT ATCCGTGCTT GAAAACAAAC 72 0 

TAAAACTCCT AATGTGGAAC TAGTTAAAAA ATTCTAAACT AGTACATTAG GAGTTTTGTT 780 

ATGCAGAATA AATTTAATTG TTAAATTGAA AGTGCGGTAA AAATCCACTA TTTATTTGAA 84 0 

AAAAATCGAG CAACCAAATT AAACCTTGAT ATCCTAAGTA AATACATAAT AAAACAAGTC 900 

CGACATAAAT TAAAAATCGC AAAATAGAAA GTCCAACTCT AAAAAGGAGG ATGACTAATA 960 

GTGCTATTAA AATAATTGTt AGTATACTCA ATGCTACAAA CCTCCTArTA CGCTTTTAAA 1020 

TCCATAA 1027 
(2) INFORMATION FOR SEQ ID NO: 423: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

35 CGGCACATTT TTAATTTATA CATATTTTAA AACTAAGTAA CAGTTTGAAG AAATCGTAGT 60 

TCAATAATGT TAATTGTGAA AATGTATATA AACATAAAAA AATCATGTAT AATATATGTT 120 

GTTAATTAAA CAGTTCGAAA GCGAGATGAC ATTATGGGAC GTAAATGGAA TAACATTAAA 180 

40 GAAAAAAAGG CCCAAAAAGA TAAAAACACA AGTAGAATAT ATGCGAAATT TGGTAAGGAG 24 0 

ATTTATGTTG CAGCAAAATC TGGTGAACCC AATCCAGAAT CTAACCAAGC TTTAAGGTTG 3 00 

GTGCTTGAAC GCGCTAAGAC ATATTCAGTG CCGAATCATA TTATTGAAAA AGCAATAGAT 36 0 

AAAGCTAAGG GTGCTGGAGA CGAAAACTTT GATCACCTAA GATATGAAGG ATTTGGCCCA 420 

AGCGGATCAA TGCTAATTGT TGATGCGTTA ACAAATAATG TAAATCGTAC TGCCTCTGAT 4 B0 

GTGCGAGCTG CTTTTGGTAA AAACGGCGGT AATATGGGTG TATCTGGATC AGTTGCTTAT 54 0 

ATGTTGTCAT GTGGCCACAT TTGGTATTG 569 



45 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

10 TGGACCTATA AAATGGATGC ATCCAAATAT GGATGnGGGG TAnGGCGGGa AAATAGGaGG 6 0 

TTATATAGTA AGTCATTTAA GTCGAATAGC CATCCTTTTT AAAATGAAAA AAATAGAAAG 12 0 

CTCAATAGTT TGTTAAAGCC TTTAAATAGC GTCGTTACGC AATTTTAGAA TGCTAAAAAT 180 

15 

TGTCACAAAT TTCAAATTAC GTGCTAAAGC TTGAGATATC AATATTTATT GGCGATAGAG 24 0 

TGTAATTTGA CTCCGCTTAC ATCTCAGTTT TATGTTTGTA ATTTGGTAG C ATAATATTAT 300 

AATAAAATAA AATTGTTAAT CTTTAATTTC AGTATAGATA TTTTTACGTG TAGTCACGTG 360 

20 

TAAAATAAAT TCAATTAGGT TAGGAGACAT AATTATGAAA ACATTTGGTA AAAAGGTTGT 42 0 

ATTAATCGGA GATGGATCTG TAGGATCAAG CTATGCCTTT GCAATGGTTA CGCAAGGTGT 4 80 

TGCTGATGAA TTTGTAATTA TTGACATTGC AAAAGACAAA GTAAAAG C AG ATG TTCAAG A 54 0 

25 

TTTAAACCAT GGTACAGTCC ACAGTCCTTC ACCAGTTGAT GTGAAAG CAG GTGAATACGA 600 

aGaCTGTAAA GATGCAGATT TAGTTGTTAT TACAGCTGGT GCACCTCAAA AGCCAGGTGA 66 0 

AACACGTTTA CAATTAGTTG AAAAAAATAC TAAGATTATG AAGAGCATCG TTAAGAGTGT 72 0 

30 

TATGGATAGT GGCTTTGATG GATATTTCTT AATCGCGGCA AACCCTGTAG ACATTTTAAC 780 

AAGATTTGTA AAAGAATATA CTGGATTACC AGCAGAGCGT GTTATCGGTT CAGGTACTGT 84 0 

35 ATTGGACAGT GCACGTTTAC AATATTTAAT TAGCCAAGAA CTTGGTGTTG CACCTTCAAG 90 0 

TGTTGACGCT AGTATTATTG GCGAGCATGG TGATACTGAA CTTGCAGTTT GGTCACAAGC 96 0 

AAATGTAGCA GGTATTTCAG TATATGACAC ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 102 0 

40 AGCGGAAGAA ATTTATGTGA ATACACGTGA CGCTGCTTAT GAAATTATCC AAGCTAAAGG 108 0 

GTCAACATAC TATGGTATTG CATTAGCATT GATGCGCATT TCAAAAGCCA TTTTAAATAA 114 0 

TGAAAATAAT GTCTTAAATG TTTCTATACA ATTAGATGGT CAATATGGTG GTCACAAAGG 12 0 0 

45 CGTTTACCTA GGTGTACCAA C ATT AG TT AA CCAACATGGC GCAGTTAAAA TTTATGAAAT 126 0 

GCCATTAAGT GCCGAAGAAC AAGCGTTGTT CGATAAATCT GTTAAAACAT TAGAAGATAC 132 0 

ATTTGATTCA ATTAAATATT TATTAGAAGA CTAAGCCTAT TTTAAGTATT AATTAGAAAT 13 8 0 

50 

ATATTAATGG TAAGAGGATC TATTAGTGTT GCAGTAACAC GTGGCACTGA TAGGTCCATT 14 4 0 
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TATGACTCGT GAAAATTTAG TGACATGAAT 
GAAAGTAGTT CTGCATTAAT CACGAaGTAG 
5 ATGGTTTTGG AATGACGAtG TCTTGTATCA 

CTTACTAGAA GGCACAGCTA CAATTAATGA 
AACGTTAACA GGGTCTGATG GCGAAtAATA 

10 

GAGCATAAAG AATTTATAGA ATTAAAAGGC 
AATTTTAAAG CGAGTAAGAC ATTTCCATTG 
CAAATTAAAA ATGAAATGTT AAGTGAGAAT 

15 

TTTAAACATA TGCATGTACG AATGATGCCT 
GATTCAGCAC GCAGACAACC TGAGGAAAAA 
TTTTTTACAC CAGAATTATT TCATGGCGTA 

20 

GATGATGAAA GAGCTTATGG TGGACATGTT 
GAG AT A C AAA ACTTTGAAAC ATTCCAACAA 

25 AAAGCGAAAA TAGACTATAA AGATGTGGCA 

ATAATTAAAA GACCTTAGCG ATATTAGGAA 
TTTTATATTA TATATGTGCT TACATATTTT 

30 TATCAAGATA TGGGGTGTCA AGAATGTGTT 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG 
CGTCGGGATA ACCTTGATAT ATAGTCATAA 

35 ACCCCTCTGC CTGTGCAACA CGACAACCAT 

TTTCAGGATT ATGCATGATT GCAACGGTCT 
CCAAGTTAAC TAACAATTTA TACCAAATAG 

jn 

CGATTTGACT GTCTTGGACC AAATCTCTAA 
GTAATTGATA ATCTCTGAAG TGCGTAACAA 
CAACTGCTTG GCATACGTTT TTAAATGAAA 

45 

CTAAAATGAT GAGCGTGTCC TCATGTGCAA 
GTTGATGTGT TTTGACTGCA ATAATAATAA 
CTTTCACAAC AATATCTTGT GCAGGTGCAT 

50 

CGTGTCTCCC GATAAGCGTT GTATGAGGCA 



AAAAATGTTG AGGCGTCATT GAGTATAAAG 1560 

AGCATGACmA CGAGGAATAA CTATAGGGaG 1620 

ACATGGTACA TTAGGTACGT TAATGGCTGG 16 80 

ATTGTTAGAA CATGGGAATT TAGGGATTGC 1740 

TTTTTAGACG GAAAGGCATA TCATGCTAAC 1800 

GATGAGAAAG TACCGTATGC ATCGATTACT 1860 

CAACAATTAT CACAAGATGA TGTATTTGCA 192 0 

TTATTTTCGG CTGTTAAAAT TTATGGCACA 198 0 

GCTCAGCAAC CGCCATATAC ACGTTTGATT 204 0 

AGACAAGATA TTCGTGGTGC CATTGTTGGA 2100 

GGGTCTGCTG G TTTTCAT AT ACATTTTGCG 2160 

CTTGACTTTG AAGTGGATGA CGTTGTCGTT 2 22 0 

CATTTCCCGG TAAAXAACGA GACGTTTGTT 2 2 80 

GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 2 34 0 

CAGGTGGTTC TAAATGCATC GTTAAGGTCT 24 00 

GTTGATACGC GCGTAAAAAG CTATAAATAG 2460 

CACGTGCACG TCGATAAATA AAGCCTTGAA 2 520 

CAATGTCGTA ATACATGCTC GTTCCCATTT 258 0 

TAGTATCGAC AGTTTGTTCT GAAAAGTTTA 264 0 

CTAGTAATAG CTGTCTACAC AGTGTACGTA 2700 

GTCTACCGAG TGCTGTGATA GAATTAATGC 2760 

CTTGTTGAAT ATTTGCTTCT AATACAATGT 2 820 

ATTGTCGAGT TAATGCATTA TCTTGTATAC 2880 

CATCGCCTTT CTTTTGACCA CTTATATAGA 2 94 0 

TATGTTCGAG TTGACCATAA CCATTTTGGG 3 00 0 

GATGAGTTAA ATGAGGAATA ACTGCATCAA 3 06 0 

CATCAAATGT GTTTGTGACA TCTTCATAAC 3120 

GTGGTACAGT ATAATATGTT ATTGTTTTGG 3180 

ATGATTGTTG TAATT CATAG GCAATAGTTG 324 0 
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(2) INFORMATION FOR SEQ ID NO: 425: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 



15 



25 



30 



35 



AGAACGAACC 


TTTAATACGT 


ACGTGTCGAC 


CACGAACTGT 


CATGACAAAT 


TGATAAAACG 


6 0 


AATAGTTTTT 


CATTAGTTCA 


TTGTCACATC 


AATCACTTTT 


GTkTCACCTT 


TAATCACAGC 


120 


ATTTTCATCA 


TAAATATTAA 


TTGAAGCTGC 


TTGATCAGTG 


TTAGTAATTA 


TAATTGGTGA 


18 0 


AATTACAGAT 


TTAGCGTTAT 


TATTAATATA 


TTCAAGGTTG 


AATCTTACTA 


ATGGATCTCC 


240 


GACGTTAACT 


TCGTCACCAC 


TAGACACTAA 


CACTTCAAAT 


CCTTCACCGT 


CTAATTGAAC 


300 


TGTGTCTAAA 


CCGATATGAA 


CTAATAATTC 


TAATCCGTTA 


TCTGCTTTTA 


ACCCAATCGC 


360 


ATGCTTAGTT 


GGAAAGACAT 


TGTCAACACG 


TCCTGCAATT 


GGAGACACAA 


CTTCTCCTTC 


420 


AGTTGGATTA 


ATACCAAAAC 


CTTCGCCCAT 


CATTTTTTGT 


GCGAATACAG 


GATCTGGAAT 


480 


ATCTTCAATT 


TTCACGAATT 


CTCCAGTTAA 


TGGTGCATAA 


ATTGCGATAT 


CTTTCTGAAC 


540 


TTCTTTGCCT 


TTTCCGAATA 


ATTTTTTAAA 


CATACTTTCC 


ACTCCTACTT 


ATCAAAATGT 


600 


GATATTAAAT 


CGCCATAACC 


CAATTCTTCT 


AACTTTTCAT 


ATGGAATAAA 


TTGAATTGCA 


660 


GCGG AATTGA 


TACAGTATCT 


TAAGCCGCCA 


CTTTCTTTAG 


GTCCATCATT 


AAAGACATGT 


720 


CCTAAATGAC 


TATTTGATTC 


TTCTGAACGC 


ACTTCAGTTC 


TCAACATACC 


AAATGATTTG 


780 


TCGACTAATT 


CTATAA 










796 



(2) INFORMATION FOR SEQ ID NO: 426: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 93 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 426: 
ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 60 
TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 120 

50 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATACG 180 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC TAAACGTAGT ACTAGTACTG CAAAAATAAA 30 0 

TACAATCACA CCAAATATAA AGTTAATTCT CTTGTTAATT GTATTTTGAA CGATTTCATC 36 0 

5 ATTTGATTTT TCTTTTAGTC TTTTTAACAA AACTACCTAC CTCTATTCAA AGTCTTTCAC 42 0 

TTTAAATCAT ATATGAATTT AGAAATTATT TCTATCTTTT TGACAAAAAA ATAACGGTCT 4 80 

CATTtAAGAG ACCGAACAAG TAATCATACT TTATTTTGTT GCATTATATA ATTCGTCAAC 54 0 

W TTTTTCCCAG TTAACTACAT TCCAAAATGC GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC TAAACCTAAA ATAGGTG TTT TACCCTCAGT 660 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC AATTTCTAAC TGGCCATTGT TTACGACTAA 72 0 

IS 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC TGCTTTGTCA GCAAATTCTT TTTTAAATTC 780 

TTCTAAAGAA CCCCATTGTT CTTTAATTTT TTCTACTACA GTACCTTTTT CTTCTGAGTT 64 0 

TGGTGAAAGT AAGTGCCAGA ATAATGAATG GTTTAAATGT CCACCGCCAT TATTACGTAC 900 

20 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC TAAATTAGCA ACAATTTCTT CAATAGATTT 96 0 

AG ATT CT AAA TCTGTACCTT CTACTGCAGC ATTTAATTTC GTAACATACG TGTTATGATG 102 0 

os TCTGTCATGG TGAATTTCCA TAGTTTCTTT GTCAAAATGT GGTTCTAATG CATCAAATGC 10 8 0 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT AAATAATCAT CCTCGTAAAA TGTCTGTAAG 114 0 

TAAATAATAA CAAGCAGTGA CTGGTTCAAC AAAGAATTTG CTTAAATTCT ACTACTTATT 12 00 

30 ATTTTCTCTA CTCATTTAAT ATAACTCAAA TCAAAAATAA TTAAACATTT TGTATATAAA 126 0 

AAGTTAACAG ATTTGCCATA AAATCATACG AACGGAGTAT GAAATGAACC TTTATCTTCT 13 2 0 

ATAATTTAAA AAATGAGATT TATGCATACA TCGGACCAAA TGTGCATAAA TCTCATTTCT 13 80 

35 TTATATTAAT CTTGG cAAGA CTCACATACA CCATAAACTT CAAGTTTGTG TTTGTGAATA 144 0 

TTAACACCAG GTAGTGATAA TTTTATCTGA TCTATTGGAC AATAATCTAT TACCTTTGTA 150 0 

TCTCCACACT TTTCACAGAT AAAATGATGA TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

40 TTCATTTCAC CATCAAGTTC TGTATTTTCA ATAATTCCTA AATCTTTAAA TAAGTGCAGG 162 0 

TTTCTATATA TTGTGTCGAA TGAAATTCCA GGATAATTTT CATCCATAAC TTGTTGTATA 1680 

TACTTTGCGT TTATATACTT ATCTTCTTCG ACAAAAATAT CTAACATATC TTTACGTTTA 174 0 

45 

TCTGTATATT TTAAACCGTT CTCTTTTAAA ATTTTAATAG CAT CATTTGT ATTCATTGAT 18 00 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT TTTCTGATAA GCCATTGTAA TCATAAGTAA 186 0 

AATAACAAGT AGAACTACAA TTACACCACC CGGAGAAATG TCCATATAGA AAGCTAGGAC 192 0 

TAAGCCTAAT ATTACTGATA ATTCACCTAA AAATACACTT AGTAATATCA ATTG CTTAAA 1980 
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50 



TATCCCTACA ACACGCATTG AGGCAGAAAT AACCATCGCT ACAATAACAA TAAATAAAAA 2100 

TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATATTCC TCATCAAATG A CAAT AT AAA 2160 

TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 2220 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AG CAAT C C AA CAATTTCTTG 2 280 

ATTGAACCCA TCAGCTAATG AAATGAAGAT TGCACTCAAG GCGATACCAG CACTCATTAT 2 34 0 

AATTGGAATA GCAATTTCTT GGTAAGCAGT GTATGACGTT CTTAATTTTT CAATTAGAAG 24 0 0 

CGCACCTACT ATTGCGAATA AGATTCCAAA CCACATTGGA TT AAT AAA T A CTAGTGTTGG 246 0 

CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 2 52 0 

ATCAGCTwTA AGTGATAGTC GTCTAACAAc GATAAAAGCA C CG a TT AG AG GCGCAATAAA 2 58 0 

ACCTAtCAAG AT Ac CACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 2 64 0 

TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA ATTACAACAT 2 70 0 

TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 276 0 

ACTTCATCAA GTGATTTAAA CTCATCAGTT GTACCATGGA AATGCAAATG CTTATTTAAA 2820 

CATGCTACTT CAGTAGCAGT ATCTGCTACA ACACCGATAT CATGAGTAAC TAAGATAATG 288 0 

GTGATACCTT CTTGTTTTAA TTGATCTAAA GT ATT AT AAA ATTCACTTAC ATGTTTTGCA 2 94 0 

TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT CAATGCTCGA 3 000 

GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATATTTTT ATGAATTAAA 3 06 0 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT GCTATTAAAT 3120 

GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AACACTTGCT 3180 

GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTGATGTT 324 0 

TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 33 0 0 

CTAGAATCAA CTT CAAT AAT GkTGATTTAC CAGCACCATT TGGkCCAACA AwTGCtAAAA 3360 

ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 3 3 93 
(2) INFORMATION FOR SEQ ID NO: 4 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1123 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNFSS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
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CCCTGTTTCT TAGCCTTTTC GTATATGCGT TCATAAGAAG AATTCTTTTT GATTTCAGAT 12 0 

TGTAAAGCAC TGTTTTCACT TGATTGTTTA TCTATTTTAT AATCTAAATC TGCAATCTTT 180 

5 

CCTCGCGTAT CATACGCATC CATTTTTAAA GAT AG CAT AT AAATACTTAA CATAG CAATT 24 0 

ACAGTAATCA AAGTTATGTA TAAAACTTTT TCAAATTTAG TTAATTGTAC AACCACTTTT 3 00 

CTCGAAACAG TCTTCTTTTC GGGCTTAGTT TGTGGTTGTT GCTTCGGTAT ACTATTATAA 36 0 

10 

ACTTGTTCGT CATATGGTTG GTACACTTTT TCTACAGCCA TTATAAATTG CTCCTTATTT 4 20 

AAGTATTTCA GCTACACGTA ATTTCGCGCT TCGTGCTCTG TTATTGTCAT CTAAATCTTC 4 80 

TTCTGTAGCG GTAATCGGTT TACGATTAAC ACGCTTTAAC TTAGGTGTAT ATGCTTCTGG 54 0 

15 

TATAACTGGT AATCCTCTTG GTACCTCTGG ACCTTTTTCA TATTCTTGGA ACACCTGTTT 600 

ACATAAACGA TCTTCTAAAG AATGGAAAGT GATTACCGAA ATCCTGCCAT CTACTTTCAC 66 0 

TAATTCAATC GCTTGTTCTA TTGAATCTTC AAAAGCTGAC AATTCATCGT TTACTGCAAT 720 

TCGTAGTGCT TGAAATACTC GTTTTGCAGG ATGTCCGCCT TTTCTTCTTG CTTTTGCAGG 780 

AATACCTTCT TTTATAATGT CAACTAATTC TAATGTTGTT GTTATTGGTT GTTGTTCGCG 84 0 

25 ATGTGCTTCG ATTCTTCGAG CTATCTGTTT TGAAAATTTC TCCTCGCCAT AG CG AT AAAA 90 0 

AATCTTCACT AACGCTTCAT AT G AC CAATT GTTAA CAATT TCATATGCTG TTAGTTCTTG 96 0 

TGTTTGGTCC AT ACG CATGT CTAATGTTGC GTCATGGTGA TAACTGAATC CTCGTTCTGG 102 0 

30 AATGTCGAGT TGTGGGCTTG AAACACCCAA GTCGTAATAA ATTCCATCTA CTTTTTCAAT 1080 

GTTTAAGTCT TTTAATATTT GAGTTAATTC ACGGAAGTTG CTA 1123 
(2) INFORMATION FOR SEQ ID NO: 428: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 6 0 

45 

ACGTATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 12 0 

ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA 18 0 

AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 24 0 

50 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAA CATGT TCGTATGCAT 3 00 



55 



1329 



EP 0 786 519 A2 

CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 420 

GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 4 80 

TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 54 0 

AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 600 

CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 66 0 

TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 720 

TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 7 80 

TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 83 8 
(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 6 0 

TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 12 0 

TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 180 

CTTGT A C AAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 24 0 

CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 300 

GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 36 0 

TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 420 

40 CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 4 80 

TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 54 0 

GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 600 

CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 66 0 

ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 720 

CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 780 

CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 84 0 

ATTGATTTCA AG TCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 900 
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AAACCATTGT TGTCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 



1020 



ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 



10B0 



GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 



1140 



GGtATTGCAA 



1150 



w 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

AAGAATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 12 0 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGGCATTC GTCACATTAA 180 

ATGATGGCAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 24 0 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGAC CAT AGAAAGCAAC 3 00 

AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 4 20 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 4 80 

AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 54 0 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

TGCTTATTAT TTTTATG AG T TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nGCGGAAAAA 780 

TAATAACnTT TTGGATn 7 97 
(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTCCAAGAG AAGTTGCGGA AGTAATTAAT AAAACGCATC ATAATAAATT GGTCATTTCG 6 0 

ATGATTTCGk CACAAATCGA TGCGGATAGA ATGGATTATT TACAACGTGA TGCGTATTTC 12 0 

ACAGGTGTAT CATATGGTGC TTTTGATATG GAGCGTATTT TAAGATTAAT GCGACCTTCT 18 0 

AAAGATGAAG TACTAATCAA AGAAAGTGGT ATGCATGCAG TTGAAAACTT TATTATGAGT 24 0 

CGTTATCAAA TGTATTGGCA AATTTACTTC CACCCAGTTA GTCGTGGTGG AGAAGTGCTG 3 00 

CTTAATAATT GtTTGAAACG CGCAAAACAG CTTTATAATG AAGGCTATGA ATTTAAGTTG 3 60 

CAT CCA CATC ATTTTATTCC ATTTTTTGAA GAGACAGTTA CGATTGAACA ATATGTTGAA 4 20 

CTCGATGAAG CGGTAGTTAC GTATTATTTG GAAAAATGGA CAAAAGAAGA TGATGCTATT 4 80 

15 

TTAAGTGATT TAGCAAGTCG ATTTATTAAT CGAGACTTAT TTAAATATAT TCCATTTGAT 540 

GGCTCAATTA TTACAATATC AGAACTGCAA GAACTGTTTG AAGCAGGTGG TATTAATCCA 600 

20 GATTATTATT TTGTGAGTGA AGCATTTTCT GATTTGCCAT ATGACTATGA TCGACCGGGG 660 

TCAAATCGCA AACCGATTCA TTTATTAAGA CAAGATGGTA CGATTAGAGA AATAAGCAAT 72 0 

CAATCATTAG TCATTCATAG TATTACAGGC ATTAATCGCC AAGACTATAA ATTATATTAT 7 80 

25 CCTAGAGAAA TGGTTGCAAA GATTAAAGAT AAGACAATTA GAGAAGCTAT TGAAAATTTG 84 0 

ATTAATGAGC TTAATTAAAC AGGGCTAAAA TTGTTATCGT TAAATATGGA GGTTATATCA 900 

TTGTCTGAGA AAAAAGGCTT TAATTTTAAT ATCATAAAAA ATGACCCTCT AGATGGTCAT 96 0 

30 

AAAGGTACAA ATATTGGTTC AATTAG CTTA GACAATATTG CACCAGTTTT TATCGATGTT 102 0 

GCTAACAAAG AAGCATTTAT TGATATTGGA GGCATGCATG CTCGTGCCAA AGTTGAAAAA 1080 

35 GGTGTGAAAT GGATTACTGA TAAAGCTGCT GTTGAAGGCG ATGAAGCTAA AGaATATTGG 114 0 

TTGTGTTGGG TAACAACAGA ACGTAATGAA CAAGGACCAT ATTACGCTGG TTTAACAGCG 12 00 

TGCTATTTAT TAGTGAATAA AGCAATTCGT CGTGGTTATA AAAGTATGCC TGAACATGTT 1260 

40 AATATGATGG ATAAATCAAT GAAACATCAT ATTATCATAG ATCAAATTGG TGACGAGAAT 13 20 

AAAGCTATTT TAAAAGACTT TTTAATGAAC CATGATGAAG GTATGTGGAA GCATTCTTCT 13 80 

GATGCTTTAC ATCAAGCATT TAATTAAATA TTAGAAACTA AAATTTCCCA ATTAATCTAT 144 0 

45 

AAAGATATGA TTCATTTCTC AATGAC 14 66 
(2) INFORMATION FOR SEQ ID NO: 432: 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS : double 
CD) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 





TTGTACGAAT 


ACAAAGTTGC 


ATTAATATAG 


AATCATTTAT 


GCTTTTTAAT 


ATCACAAAAC 


€0 


5 


AAATTGATTA 


AACTAGCTAA 


AATAG TCAAA 


ATTGGAATCA 


AGATTACATA 


AAT AT CAT AA 


120 




CCTCTAGATA 


TTAGTATACA 


TAATATAATT 


AGCGACCCTA 


TTATAATTGC 


TAAATTATAA 


180 


10 


GCCAAACTAA 


TTTTATATTT 


CATTTCAATT 


CCCACCTTAA 


TAGCATTTAT 


TAAAATTCTA 


240 


TAATGTTTAC 


ATAGACTTCT 


TAACAAACTT 


TAACCCAGCT 


AAGCTAAGTA 


TTGAAATAAT 


300 




TACACCTCCT 


AATATAAATA 


TATTACTTAA 


GCTTAGTAAA 


GGTATAATTA 


TACTTATCAG 


360 


15 


TCCTAAAGAC 


AATGTATCCG 


CTGcATAATT 


CGaTGtAGAT 


GAGATACTAA 


ATACTTTCCC 


420 




CATCAAATGA 


TTTGGCGTTT 


TTATTTGAAT 


GGCAACTGAT 


CTTGTTAGTC 


CCTCTATAGA 


480 




TTGTCCAAGT 


CCCAATAATG 


TTGCACCTAT 


ATATAATATC 


GCCACACTTG 


GAAACACATT 


540 


20 


AATAATCGAT 


AAGCCAATTC 


CCCAAACTAA 


AACACCAATA 


CTAAATTTAA 


AGATTAATCG 


600 




CTTTTCTGAC 


AGCAAACCCA 


TAATCAATGA 


CATTAATAAA 


GATGCTATAC 


CTAAACATGA 


660 




TGTAGCTAGT 


CCATATACAC 


CAACGCCCTC 


TTTTAATATA 


TTGGAAATAA 


ACAATGGTAA 


720 


25 


TACAACACGC 


CAAAGGCCAG 


TATTAATCAA 


TATGCAGGCA 


AATTGGATGA 


TTATAATAAA 


780 




TGGAATTTCT 


TTAGATTGTT 


TCAAGAATTC 


CCAAGTTTCA 


GAAAAATCTT 


CTTTTGAGTG 


840 


30 


TCTATCAATC 


ATGTTGTTAT 


TTGTATATTT 


TAAAAGTGCA 


TTAAAAATAA 


ATC CTAAAAA 


900 


TAGCAATATA 


CTACAAATAA 


AAAAGACGCC 


AACATTACCA 


ACTAGTATTA 


CAATGACACC 


960 




AATTAAAGCA 


GGTAAAATAA 


TATTTGAGCC 


TCTTTGCAAA 


CTATCGATTA 


ACG CATTACT 


1020 


35 


TGTTGCTAAA 


TGCTCCTCAT 


CAATAATTTC 


AGGAAGAATT 


GCCCTAAACG 


CAGGATCCGT 


1080 




ATAGCAGTTA 


ATAATGGTGA 


TAGCTGTAGA 


TATGGTTAGA 


AGCGTCAGAT 


AATTTAAATT 


1140 




TGATGTTATT 


GCAAGTAAAG 


GAATAATTAT 


TATAATCAAA 


CTTAGTATAA 


GATCAGATAG 


1200 


40 


ATAAAGTATT 


TTCTTTTTAT 


TATGTTTATC 


AGAATATGCG 


CCACCGAAAA 


TACCAAATAT 


1260 




AATAAATGGA 


AGTGTTTGAC 


TCATAACCAT 


CATTGATAAT 


TTTAAAGATG 


ATTGGTTTGT 


1320 


45 


CAATTCAACA 


GTAAACCAAA 


TTATTTGTAA 


CGAAAACAGC 


ACAAAACAAC 


TCCGACGTAA 


1330 


GArATTACCA 


ATCAATAAAT 


ATGTAAAGTT 


TCTATTTTTC 


AAAACTTCTA 


AATACAACAT 


1440 




ATTTATCACC 


TCTCATAAAA 


ATAATTGAAT 


GCATCCACCA 


GCTTTTTTAG 


ACCTTCTTCT 


1500 


50 


AAACTCTCTT 


TATCCAAAGC 


GCAATTAATT 


CTAATATAAT 


TTAGTCAGTT 


AAATATCAAT 


1560 




TATTTCGAAA 


TATACATACT 


ACTTGAAACA 


CCATACATAA 


CCCCCAAAAT 


GACTACTCAG 


1620 




AGGTTATATT 


CTACTAATTA 


TGATTATATT 


AAATATGAAA 


ATATTATCAA 


AAAAATCAAA 


1680 
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TCATTCTCAT CATATTCTAG GTTGTTTTTA ACAAACTAAA TATAGTGAAT GCAAATCAAC 180 0 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA i86 0 

5 GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT 192 0 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 1980 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 2 04 0 

w 

GATAATCCTG GATTGTTAGG ATTTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 2160 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCACCTGTT 222 0 

15 

AATGGGkTCA CTGTGAwTGG TGkTGTGAwT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 2 2 80 

ACTCGCTCTT CACCAGGTTG TAAT 2 3 04 
20 (2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
26 (D} TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

30 

GTCTTAATAT TTTACAAGAT ACAGCCTGAC AATATTTATA TTACAAACCA ATATGGTAAA 6 0 

GAGGCTGTGT CI' CG ATT AAA AAGAATTTTA GAGCAAAGTA TACCTAATAA AGAATATATT 120 

35 GATTTATCAG AGATATCATT AATTACATTT TTTAAACCAG AATATAATAA ACAACATGTG 180 

AATGAGGAGT TTAATTCTTT ATCTTCTTCG AAAATAAAGA ATTTAGCCAA AGAAAATGAT 24 0 

GGTATTATTA TTTATTTAAA TTTTAAGGAT GTTAATTATC AACTGaTAaG TGAAGGCAAT 3 00 

40 ACTTTTTTTT CTGCAAAACC ATATTTGCAT TGTGTGTTCA ATAAAGATAT TTTAAATATG 360 

GATAAAGTTG ATATAGAAAA TTTTTTCAAA AGTATAGAAG TCAAATACAG TAAATAAGAT 4 20 

CAATTATTGA ATCCACCGAC CAGCAGATTT TTTTAAAAAA TTAATACCCC GTTAATACCC 4 80 

45 

TTTGCTTCAA TTTGATGAAA ATCAATGAAA TTCAAAATTG AAGAAATCCT TTAATATCAA 54 0 

GGTGTACGAC AGTCTATATA ATCATGCGAA ATTCTAAAAT TTTCtGAcGT AAAAAAACAA 6 00 

go ATTCyTAAAG CAGCTCGTCG TTCACCTCAA TTCTCAAAAC GTTAATTGTC GGACGATATA 660 

TATACAAAAC ACCTCGATGT TATGTCGAGG TGTTTTTTTG CGTTTGTGTG GGGAATATGG 720 

AATAGAGTAT AAATGAATTA ATGTCTCAGG T AT AG AA TT A ATTCAACTAT GAATTATTGA 780 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAG AG CT AT 90 0 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 96 0 

GTGACTTAAT TG AAAG ATT C TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 102 0 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 10 8 0 

TATTTAATGA GTATCTTCTT AATTTG TTT A GGAACTGCTT CTATCATTTA TAACTATAAA 114 0 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGAGAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 13 2 0 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 1380 

TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTT ATT CATC TAACAAAGGT 14 4 0 

TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 1500 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 156 0 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 162 0 

TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 1680 

GGTATTC CGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 174 0 

CTTATTGTCA CXCGXCAXGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 1800 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 186 0 

GG ATT ATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 198 0 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2 04 0 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

40 CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAG CG ATTAAAAATG CATCATTTCC 2220 

TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 234 0 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 24 0 0 

ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 24 6 0 

TAGTAACTGG AATAAAGAAT TAG AAA CAC A AATTATCAAT ATTGACGAGT TCAAGCTGAA 2 52 0 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AACGATTGAA 2 58 0 
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CAATTAAGAA CACATATTGA TGAAGAAACT GGTATTAAAG CGACGCATGA AGAAGGAATT 2700 

CTACATCCAG TGGAAATTAT TGAATCTATG CAAAAGGTAT TAACTGATGA TACTACTGTA 276 0 

ACAGTTGATG TTGGAAGTCA CTATATTTGG ATGGCACGTA ATTTCAGAAG TTACAATCCA 282 0 

AGACATTTAT TATTTAGCAA TGGTATGCAA ACGCTTGGTG TAG CATTAC C GTGGGCAATT 288 0 

TCAGCTGGAC TTGTGCGCCC TAATACGCAA GTTGTGTCCG TTGCTGGCGA TGGTGGCTTT 2 94 0 

TTATTTTCAT CACAAGATTT AGAAACGGCC GTACGTAAAA ATTTAAATAT CATCCAGCTT 3000 

ATTTGGAATG ATGGAAAATA TAACATGGTT GAATTCCAAG AAGAAATGAA ATATAAACGT 3060 

TCGTCAGGTG TAGACTTCGG TCCTGTAGAT TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 3120 

is 

AAAGGTTTAC GAGTTACTAA TCAAGAAGAA TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

ACAGATGGTC CAGTATTAAT TGATATACCT GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

20 ACAAATATGT TACCTGACGT ATTTAACTAA AATAAAGATA AATGTTAAAG AGGAGTGGGA 3 3 00 

GATTTTATGA CTAATGTTTT ATACCAACAT GGTACATTAG GCACATTAAT GGCAGGATTA 33 60 

TTAAAAGGAA CTGCATCAAT AAATGAATTA TTGCAACATG GTGACTTAGG TATCGCTACA 342 0 

25 CTAACAGGTT CAAACGGTGA GGTAATCTTT TTAGATGGAA AAGCTTACCA TGCAAATGAA 34 8 0 

CATAAAGAAT TTGTAGAATT AAAAGGTGAC GAGTTAACAC CATATGCAAC TGTAACTAAA 3 54 0 

TTTGTAGCAG ATACAAGCTA TGAAACGAAA GATAAATCTT CAGAAGCAGT TTTTGCAGAA 3 60 0 

30 

ATTAAGGAAA AGATGTTGAG TGAAAATTTA TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3 66 0 

AAAAAAATGC ATGTACGTAT GATGCCGGCT CAAGAACCAC CTTATACACG TTTAAT CG AT 3 72 0 

35 TCAGCTAGAA GACAACCTGA ACAAACTGAA ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 37 80 

TTTACACCAG AATTATTCCA TGGTATCGGA TCAGCAGGAT TTCATGTACA CTTTGCGAAT 3 84 0 

GATGATCGTA ACTTTGGTGG ACATGTCTTA GATTTTGAAG TAGAAGATGT TAAAGTAGAA 3 900 

40 ATCC AAAAT A T AG AAA C ATT TGAACAG CAT TTTCCAATTC AAGATAAAGA TTTCACTAAA 3 96 0 

GCAAATATTG ACTATAAAGA TATTGCAGAC GAAATTAGAG AAGCTGAATA ATGAATCCAG 402 0 

AAATATAATG ACGGTTTATG AAAATTGACT TCATAATGCG CGATTTAGAA ATGATAGTTT 4080 

45 

GTAAATATGA TTAACCATGA CTACAATAGA ACAAATATAT TTATAATTAC GTCTAAGTAA 414 0 

TAAAATAAAT CCCTTCACTA TT AG CAGTAG TGAGGGGATT TATTAGGTTC CAGATATTTG 4 20 0 

AGATTTGCTG TTATGTTTAG ATTATAAATT GTGGTACACA CTCATATAAA ATTTACTATT 4 26 0 

GTATAGGCCA ATCTGTTACT ACGAGAAGCA AACAACAATA ATTTACAAGT TCAATAACTA 4 32 0 

AAAAGACAAA CGCCAATTTT TCAGCGCTTG CCCTATAAAA CTATTTTCAA ATTATTATTT 43 80 
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TTTCTTTAGA TTCACTACTT TTTTATTACC ATCATTCAAA GTAAGCGTAT AAGTTGCTGT 4 50 0 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CA1TTTTTAC 4 560 

TTTACTGTCA ATTTCTTGAT AAAGAACATT TTTATTTTCT GGG AAGATAA AGTAAGTTCG 4 62 0 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 4 6 80 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 4 73 3 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 

AATAAATGGT AATTTTATAT CATCAACGGC TACAAATGGG GCAGTTTCAT ATATAAATAA 60 

ACACATCTAC TCAACTGnTC TTAATTAATT AATAAATGAT TTACCAGATG ACATATAACA 120 

25 

GCCATTATTA GnTTAATTCT AATAGTTTAT TTAAATTTTC TTCGGTTGTC GCCCAACTGG 180 

TTGCGAATCT AACAACACGA TGTTGATCAT CGTATTTTTC CCAAACAGCA AATTTAACTT 24 0 

3Q TTTGTTCTAA CTCTGCTATT TTCTCGTTAC TTAAAATAAA AAATTGTTGA TTGGTTGGAG 300 

AATCAAAGTA AAGACGATAG CCTTTATTTT TAAACCCGTC TTTCATCTTA TTTGCCATTT 36 0 

CGATAGCATG TCTGCTTATA TTAAAATATA AATTGTCCGT AAATAATTCT AAAAATTGTA 420 

35 TGCCTGTTAA CCGTCCTTTT GCTAAAAGGG CACCGTGGAT GCTTGATTCG AGTGGTAAAT 480 

TGTTTCGGTT CATTATTTTT CGTAAAAACA ATGGCTTCCC CGCATAATGC ACCTATCTkC 54 0 

GTACCACCTA TATAAAATAC ATCACAATAT TTAGCGrTGt CTTTAATAGT CATATCTGAT 600 

40 

TGGTCACTCA TCAATCCATA CCCTAATCGT GCACCATCCA TAAATAATGG AAGCTGATAT 660 

TGCTTACATA CTTTGCATAA CTCTTCCAAT TCTGaTTTAG AGTATAATGT GCCATATTCT 720 

GTAGGATGAG AAATATATAC CATTCCTGGG AATACCATAT GGTCCTTTTT AAAATCACTT 780 

45 

TTAAATGTCT CCATGTAAGT TTCAACATCT GAAGCACTAA CTTTTCCTTC CTTAGAGGGT 84 0 

ATAGTAATTA CTTTATGTCC ACTATATTCA ATTGCACCGC CCTCATGCAC AG CAACATGA 900 

50 CCAGTGTCTG CTGAAATGAC CCCTTCGTAA CTTTCTAACA TTGAATTAAT AACAACCTGA 960 

TTGGTTTGTG TTCCACCTaC TAAAAAACGA ATTGTAGCAT TTGGaCAGTC AATTGTATCT 1020 

TTAATCTTTT CAATTGCCTG AGCTGTGAAT TGaTCAAAGC CATATCCCGA AG CTTGTACA 10 80 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 1200 

AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 1260 

5 TATCGCGAAG TTGnATAAAT ATAT 1284 

(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

20 AG ATGG CAC A ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 120 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 24 0 

25 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAG CAT CAGTATATAT 3 00 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 3 60 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 420 

30 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 4 30 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 54 0 

35 ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 6 00 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 660 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 720 

40 TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 7 60 

TGAAGCAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 84 0 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 900 

45 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 96 0 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 102 0 

so TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 1072 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 
TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 60 

CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 120 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACT CATTTT AT CAAAG AA C CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 24 0 

TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGTTTTTTAA 3 00 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 360 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 420 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 4 80 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 54 0 

TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 600 

CTATTTGTCT C7CGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 66 0 

GCAAATGgCA AAAATAATAA AT CT AATCT A AATAAGTATA CAATAGTTAA TGTTAAAACT 72 0 

AAAACATAAA CGCTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 78 0 
TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 84 0 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 90 0 

TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 102 0 

CCTCTTTTAT AT CTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 10 8 0 

CATTGGCTAT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 120 0 

45 TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 126 0 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 132 0 

AGCCGTAGAC AC GAG TAT CT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 13 8 0 

50 TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTT CT A 144 0 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 15 0 0 

TGATAACTGG AC CAT CAG C A AC C GG AT C AG AGAAAGGTAC TCCAATTTCA ATT AT AT CTG 156 0 
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GTATAAATAA TTTAGT CATT TGCAAGACCT 
ATATCTTTAT CGCCACGTCC AGAAATAGTT 
5 GGCGCTAGTC TTTCAACATA ACTCAGTGCA 

TCATGTTTTG TAAAGTTGAT TAAAGCATTC 
GTTACTCTAC CAATGTCGTG GTAATAAGAA 

w 

CCTGCTGAAA TAGAATGTGC TAGTTGCACT 
TTAGTAC CAT GTAATACGCC AGGTGATCCT 

1$ TCATCGCCTT GACCTGCGGC TTCAACACCG 
TAAAATGTAC CGATTG CATT TGAGCCACCA 
CGACCTTCTT TCTTCAATAT CTGTGATTTT 

20 CTAACAATCG TTGGGAACGG GTCTGGACCT 

TCTACATGAC TTACCCAATA TTGCAATGCT 
CCATCTTCAA CTGCCACAAC CTTTGCACCA 

25 TGTCTTTTAA TATCTTCACT TCCCATAAAG 

ACCGTAGCAC TAGCTACACC ATGTTGACCC 
CCCATTCTTT TAGCAAGCAA CGCTTGACCT 

30 

TGATTTAGAT CCTCTCGTTT CAAATATATT 
GCAGCATATG TAAGTGGTGT CGCGCGTCCT 

35 CTTTGAAACT CTGGGTCTGC TTTTGCCTCT 

GGCAITAATG TTTCTGGAAC ATATTGCCCT 
GCTTCTGTTT GTATTTGTTT ATTCATTGTC 

40 TGTCATTTTT TCTATATCTT TTCGCCCATT 
TTGATGTGAT AATTTAAGTT GATTAACTGT 
TATCAAATAA GGTATGTCTT TTATGTGCTT 

45 ACCACCATAC GACACTGAGG GTGTGTCGAT 
ATTTATGTTT TGGATTATGT TTTCATCTnG 
ATATkTCTTT TTAATTTCCT GTAtAAAAtC 

50 

wTGaCGwATg CtTAAwACgT GTGCCAATGG 
(2) INFORMATION FOR SEQ ID NO: 4! 

55 



CGCTCTACCA TATATTGTCT AATTGTTTCC 16 80 

ACTACAATAA TATCTTCTTT CGACATCGTA 174 0 

TGTGCACTTT CAATTGCAGG TATAATACCT 18 00 

ATTGCTTGTG TATCACTAGC ATTTTCAAAA 1860 

TGTTCTGGTC CAATACCAGG ATAATCAAGT 1920 

TGCCCATCTT CATCTTGAAT TAAATACATT 1980 

TTGCCAATTG CAAGTGCATG TTTATCAGTA 2 04 0 

TATAATGCAA CATCATCTTT AATAAATGGA 2100 

CCGATACATG CTACAATTGC ATCCGGAAGT 2160 

ATTTCTTTAC CAATCACACT CTGAAAATCT 2220 

AATGCAGAAC CTAATAAATA ATGTGTATCA 2 2 80 

TTATTAACTG CATCCGATAA AGTCCCTTGA 2 34 0 

AGTAATTCCA TTCTAAATAC ATTAAGTTGT 24 0 0 

ACAACAAGTT CCATATCAAA TAATGCAGCA 24 6 0 

GCACCAGTTT CAGCAACAAG CTTCTTCTTG 2 52 0 

AACGCATTAT TAATTTTATG GGCGCCTGTA 2 58 0 

TTAGCGCCAC CTAGGCTTTC AGTATATGAT 264 0 

ACATACTCTG ATAAATAGTA TTCCAGTTCT 2 700 

TTATAAGCTT TTTTCAACTC AATAATTGCT 2 760 

CCATATTCAC CAAAGAAACC TAATTCATCT 2B2 0 

TCTATCTCCT TTCACAATAT TTACAATTGC 2 880 

TACTTCTATA CCTGATGCAA GATCATAACC 2 94 0 

TTGAATATTT TCAGAGTTAA TGCCTCCTGC 3 0 00 

CAAAATAGTC CAGTCATATG TTTGACCGGT 3060 

AATAAATAAA TCTACGAACC CTTTATATTT 3120 

CAGtAAAGCT TTAGTGATTT TAATGCTTGA 3180 

AATAGATTCt GtGTAACTGT ATTGTGkTAA 324 0 

T 3271 
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(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

™ ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 6 0 

AAAATGAAAA GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

TAATTGAGCT ATGCTTATTA TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 18 0 

15 ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAGCCCAA AATACTTGTT 24 0 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATAAAC TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 36 0 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTt CACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAAC CGT AATAACGATT 4 80 

ACCTTGAAGA CGT ATTTTT C TTATTATGTT GATTTTTCTT TAGGTGTTAA AGGTTTAGTA 54 0 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 6 00 

CTATTCTTTA AAGGCAAAAA AGCATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 66 0 

TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 720 

ACTTTAAACC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 760 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT T ATT CATTTT AATATTTAAA 84 0 

ACAAAATGGT TAGACACAAA AGCATTTAGT AAGAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 96 0 

ACACGTACAT TTGAC CATAA ATATTTAGTG AAATATTTAG GACCTTATAA CTTTACAGTA 102 0 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 108 0 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 114 0 

GGGGTGGcAA rGAAGAAAAA TATTATTArG ATT CATTT AG AAAGTTTCCA AACCTTCTTA 12 0 0 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 12 6 0 

AAAGAGCAAT TCACATACTT CCCTAACTTT TtCCATCAAA CAGGTCAAGG TAAAACATCT 132 0 

50 GACTCTGAAT TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 13 80 

TLaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAG AT CAAAA GCAAGGCTAC 1440 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA A C AG AG A C C A AGTATATAAA 150 0 
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(2} INFORMATION FOR SEQ ID NO: 43B: 

Ci) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH : 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

ACAAAATCAA TCAGCAAAAT GATTGGATTA AGACGCAAAT TGAGCGTTCA ATGGAAGGCG 6 0 

15 AAACAGTTGG CATTAATGAT CAAAATATAG AAATATATAG TGAAcGtTgA GATTTATACC 12 0 

ATACACTCGT ACCTTTAAAT CAAGAATTGC ATAAGTTGCG ACTTAAAACT CAAAACTTAA 180 

CCAATGAAAA TTATAATATT AATGATGTGA AAGTTAAAAA GATTATTGAA GATGAACGTC 24 0 

20 AAAGACTAGC ACGAGAACTT CACGATTCTG TTAGTCAGCA ACTTTTTGCG GCmAGTATGA 3 00 

TGCTATCTGC TATCAAAGAA ACGAAtTAGA ACCACCATTA GACCAACAAA TTCCTATTTT 360 

AGAGAAAATG GTTCAAGATT CGCAGTTAGA AATGCGTGCT TTGCTGTTAC ATTTAAGACC 4 20 

25 GCTTGGTTTA AAAGACAAAT CTTTAGGTGA GGGTATTAAA GATTTAGTTA TTGATTTACA 4 80 

AAAAAAAGTG CCAATGAAAG TTGTGCATGA AATACAAGAT TTTAAAGTGC CTAAAGGTAT 54 0 

TGAAGATCAT TTGTTCAGAA TTACACAGGA AGCAATTTCG AATACATTGC GTCATTCAAA 600 

30 

CGGTACAAAA GTGACAGTAG AATTGTTTAA TAAAGACGAT TATTTATTGT TGAGAATTCA 660 

AGATAATGGT AAAGGTTTTA ATGTTGATGA AAAATTAGAA CAAAGTTATG GACTTAAAAA 72 0 

^ TATGCGTGAA AGAGCTTTGG AAATTGGTGC AACGTTCCAT ATTGTATCAT TGCCAGATTC 7 80 

AGGTACACGT ATCGAGGTGA AAGCACCTTT AAATAAGGAG GATTCGTATG ACGATTAAAG 84 0 

TATTGTTTGT GGATGATCAT GAAATGGTAC GTATAGGAAT TTCAAGTTAT CTATCAACGC 900 

40 AAAGTGATAT TGAAGTAGTT GGTGAAGGCG CTTCTGGTAA AGAAGCAATT GCCAAAGCCC 96 0 

ATGAGTTGAA GCCAGATTTA ATTTTAATGG ATTTACTTAT GGATGACATG GATGGTGTAG 1020 

AAGCGACGAC TCAGATTAAA AAAGATTTAC CGCAAATTAA AGTATTAATG TTAACTAGTT 10 80 

45 TTATTGAAGA TAAAGAGGTA TATCGTGCAT TAGATGCAGG TGTCGATAGT TACATTTTAA 114 0 

AAACAACAAG TG CAAAAG AT ATCGCCGATG CAGTTCGTAA AacTTCTAGA GGAGAATCTG 12 00 

TTTTTGAACC GGAAGTTTTA GTGAAAATGC GTAACCGTAT GAAAAAGCGC GCAGAGTTAT 1260 

so 

ATGAAATGCT TACAGAACGA GAAATGGAAA TATTATTATT GATTGCGAAA GGTTACTCAA 13 20 

ATCAAGAAAT TGCTAGTGCA TCGCATATTA CTATTAAAAC GGTTAAGACA CATGTGAGTA 13 80 
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(2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

ATGTnGGCGG ACGTAAAGGA CATSTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 6 0 

TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTATTT GCAGCAGGTT 12 0 

IS 

ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 18 0 

CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 24 0 

20 AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 300 

AATATTTACA AATGGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 36 0 

ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 42 0 

25 ATTAATAAAA TTCACATAAA ATTCAAATTt StCrAcCAAA AATTTTTGGT TGGyTATTTT 4 80 

TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA C CTT AGT CAT 54 0 

CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 60 0 

30 

GTCCAGTG 6 08 

(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA AT AC C TGAAT 6 0 

45 AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 12 0 

TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCGCCAA 180 
AAACCACTTA TGTATCTAAA TAAAAATCGC GGTGGTGATG GTGGCATACC AGGTTATATT 24 0 

50 

ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 300 
GAAGTAAATC ATAATATGCG CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 360 
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ATTGGACCGT GGGTAACAAA TACTAATTGA AGTCGTGATA ACGTAAAAAT TAAAAATACT 4 BO 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 54 0 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 600 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 66 0 

ACAACACTTC CAGCACAAAA AA 6 82 



10 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2574 base pairs 
15 (B) TYPE: nucleic acid 

CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCTTTT AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 6 0 

ACGTTCAATT AATTCTTCAA CCATTGTCTT TTGCGATATA GTTGGTTCTG ATTCAGACCA 12 0 

25 

AGAAGCTAAC AT AT C AATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 130 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAGATTTCAC 24 0 

CTGTAATTTA GGCATTTGAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 3 00 

30 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ATTAAACGCT TTGCATCTTT 4 20 

GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 4 30 

35 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 54 0 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TTGTTACCTG 600 

40 TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 660 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GG AT CATTAA AAAATAAATG 720 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATGTTTGACA 780 

45 AACGCCTTTA ACAATCCAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 84 0 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 9 00 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 960 

50 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCATAACT 102 0 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 10 B0 
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10 



20 



AAATTCTTTA TAGACTGATG CAAATCTAAC ATATGAAACT TGATCAACAT GCATTAACAA 1200 

GTTCATAACG TGTTCACCTA TATCTCGTGA AGACACTTCC GTATGACCTT CATCTCGTAA 1260 

TTGCCATTCA ACCTTGTTAG TTATGTCTTC AAGTTGTTGA TATCTAACTG GTCGTTTCTC 132 0 

ACAAGAACGC ACAAGTCCAT TAAGTATCTT TTCTCTTGAA AACTGCTCTC TTGTGCCATC 13 80 

TTTTTTCACA ACTATAAGCT GACTAACTTC GATATGTTCA AATGTAGTGA AACGTGTTCC 14 4 0 

ACAATTTTCA CATTCTCTTC GTCTTCGAAT GGCATTTAAT TCATCGGCAT GCCTTGAATC 150 0 

TACAACTTTA GATTGTGTAG AATTACATTT CGGGCATTTC ATTACATCAC CCTCTTTATT 1560 

TTGATTATGC CTAATTATAC TATAAATCTA GAGATGAAAA AAGAATCCCT CAATTTAATT 1620 

CATTTAACCA AATAATGAAA CAATAAAAAA CATTATATCG TTACTTATTA AGTAATTTGC 1680 

ATGACAATAT TATTGTATTA AAAATAAAAA ACCTAACTCC GAAGTCAGAG TTAGGCTATA 174 0 

AATTAATTGT ATTAACTTGC ACTTACAGTT TCTTTTGATG TCAAAAGTGC TCCAATTTG C 1800 

TCAGCAACAT CTACAACTCT ATTTGAATAA CCCCATTCAT TATCATACCA AGCAATAACT 1860 

TTTACTTTAT TCCCTGACAT GACCATTGTT GATTTTGCAT CAATAATAGC TGAATTTGGA 192 0 

25 TTAGTATTAA AATCAACAGA CACTAGTGGT TGATGTTCGA CTTCTATGAT ACCTTCTAAA 19 8 0 

CCTGCATTTT CAAAAGCTTG GTTTACTTCT TCTGCAGTTA CTTCTTTTTC TAAATCAACA 2 04 0 

ACTAAATCAA CGAGCGATAC ATTCTTTGTT GGTACACGTA ATGCCATGCC GTGTAATTTA 2100 

30 

CCTTCTAATT CTGGTAATAC TTCTTTTAAA GCTTTCGCCG CACCAGTAGA AGTAGGAATA 2160 

ATGCTTTCAT TACATGAACG TGCACGTCTT AAATCTTTAT GTGGATTATC AATATTTTTT 2 220 

TGGTCATTTG TAATAGCGTG AACAGTAGTC A IT AAA C CAT TAACTATTCC AAACTGATTA 22 80 

35 

TTTAAAACTT TTGCAACTGG ACCAATGCAA TTAGTAGTAC ATGAAGCATT ACTAAAAATG 2 34 0 

TCAAATGCTT CTATATCTAA TTGGTTATCA TTTACGCCTT TAACTACCAT TTGAACATGT 24 0 0 

CCACCTTTTG nAGGACCAGT TAACAAAAsT TTTtTGGCAC CTGCTTTAAT ATGTGCGATG 24 6 0 

40 

GCTTTATCAC CATGaTTAAA TTTACcAGTT GCATCTATAG CAATATCGAT ATCTAATTCT 2520 

TTCCATGGCA AGTTTTCAGG ATTGCGATCA GCAACCAATT TAATTTTATG ATCT 2 574 
45 (2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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CAAGGTACCG GTTTAGGTTT GTTCATTTGT AAAATGATTA TCGAAGAGCA TGGTGGTTCC 6 0 

ATAGATGTTA AAAGCGAATT AGGGAAAGGC ACAACATTTA TTATTAAACT ACCAAAACCA 12 0 

5 GAATAAAATT GAATATAGTT ATTTCAGAAC GCATGTTATT GATTAGAGAC TCTAATTTAT 180 

AGCATGCGTT TTTTGATTGA TGTGGGGAAT TTTGTATGTG GATTAGAACT TAGGGTTTTT 24 0 

G CG AATATCA ACTATTAAAT ATATTACTAA TTTATACAAA AATATAAAGT TTGATAAAGT 300 

10 

TATTTATTTG ATTATAAAAA TAGGGTAAAA TATAGATATA TTGTATTAAT TAAATTATTC 360 

GAGGTGTCAT ATGAAAAAAT TCATTGGATC AGTTTT AG CT ACGACATTAA TTTTAGGGGG 42 0 

ATGTTCCACG ATGGAAAATG AATCAAAAAA AGACACAAAA ACAGAAACAA AATCTGTACC 4 80 

AGAAGAAATG GAAGCTTCAA AATATGTAGG CCAAGGCTTC CAACCGCCTG CAGAAAAAAA 54 0 

TGCGATTGAA TTTGCGAAGA AGCATCGTAA AGAATTTGAA AAAGTAGGTG AACAATTCTT 6 00 

20 TAAAGATAAC TTTGGACTAA AAGTTAAAGC TACAAATGTT GTAGGTAAAG ATGATGGTGT 6 60 

AGAAGTTTAT GTGCATTGTG AAGATCATGG CATTGTATTT AATGCAAGTC TACCTTTGTA 720 

CAAAGATGCC ATCCATCAAA AAGGATCAAT GCGCAGTAAT GACAATGGTg ATGATATGAG 780 

25 

TATGATGGTG GGTACAGTGC TGAGTGGCTT TGAATATCGA GCGCAAAAAG AAAAGTATGA 34 0 

TAACTTATAT AAATTCTTCA AAGAAAATGA AAAGAAATAT CAATATACAG GCTTTACAAA 900 

AGAGGCAATT AACAAGACAC AAAATGTCGG ATATAAAAAT GAATATTTTT ATATTACATA 96 0 

30 

CTCTTCTAGA AGTTTAAAAG AATAT CGAAA GTATTATGAA CCACTGATTC GAAAAAATGA 102 0 

TAAAGAATTT AAAGAAGGAA TGGAACGAGC AAGAAAAGAA GTGAATTACG CTGCAAATAC 10 80 

35 AGATGCTGTT GCTACACTTT TTTCTACTAA GAAAAACTTT ACTAAAGACA ATACAGTAGA 114 0 

TGATGTAATC GAACTAAGTG ATAAATTATA TAATTTAAAA AATAAACCAG ATAAATCTAC 120 0 

AATCACAATA CAAATAGGGA AACCCACTAT TAATACTAAG AAAGCCTTTT ATGATGATAA 1260 

40 TCGTCCAATA GAATATGGGG TGCACAGTAA AGATGAATAA AATTAATGAT AGGGATTTAA 13 20 

CAGAATTAAG TAGTTACTGG GTTTATCAAA ATATTGATAT AAAAAAAGAA TTTAAAGTTA 13 8 0 

ATGGAAAAAG GTTTAAACAA GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 144 0 

45 

GTGCTGCTGA TATTAAAATA TATGAGTTAT TAGATGATAA AAGTAAACCA ACTGGTCAAC 1500 

AGACAATAAT TTATCAAGGA ACATCTAATG AGGCAATTAA TCCAAATAAT CCATTAAAAT 156 0 

50 CATCGGGGTT TGGAGATGAT TGGCTCCAAA ATGCTAAATT AATGAATAAT GATAATGAAA 162 0 

GCACAGATTA TTTAAAGCAA ACAGATCAAT TATCAAATCA ATATAAAATA AAGTTAGAAG 1680 

ATGCAGATAG ATTATCAAAT AGTGATTTTT TAAAAAAATA TAGAATGGAA TCAAGTAACT 174 0 
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ATCAAGGAGC GAAACATCCG AATGAAAAAG TTGTTGCTAC TGACTCAGCA ATGATTCCTT 1860 

ATGCTGCTTG GCAGAAATTT GCTAGACCAC GCTTTGaTAA TATGATTAGT TTTAATAGTA 1920 

5 CCAACGATTT ATTAACATGG TTACAAGATC CATTCaTCAA AGATATGCCA GGAAAACGCG 1980 

TTAACATTAA TGATGGTGTG CCCAGGTTAG ATACTTTAAT AGACAGCCAT GTAGGTTATA 2 04 0 

AAAGGAAGTT AAATAGAAAA GATAACACAT ACGATACTGT ACCACTAATC AAAATAAAGT 2100 

w 

CGGTAAAAGA TACAGAAATT AAAAATGGAA AAAAAGTAAA AAAGACTATT AACATAACAT 216 0 

TAGATATGGA TGGGCGAATT CCAATAAATG TTTGGACAGG AGATTCGATT GGACGTTCTG 222 0 

GAAGAGGAAC TTTAATTAAA CTTAATTTAG AAAATCTTGA TGCGTTGAGT AAACTGATTA 228 0 

15 

CTGGTGAAAC AAGTGGTATG TTAGCAGAAT GCGTAATCTT TTTAAATGAA AGTTTTAACA 234 0 

TCTCAGAAAA TGAAAATAAA AATTTTGCAG ATAGAAAGAa ACAATTATCA GAAGGATTTA 24 00 

2 0 AGGATAAGAT TAACTTATTT CAGTTAGAAG AAATGGAAAG AACTTTAATT AGTAAAATAA 24 60 

ACTCACTTGA AGAAGTTGCA GATGAAACAA TAGAAAGTAT TAGTGCTGTT AAACACTTAT 2 520 

TACCTGATTT TGCATTGGAT GCATTAAAAG AAAGAATTAA TGAGTTGTTT AAAGGTATAA 2 5 80 

25 AATCTTTTAT AGAAAAAGTG TATGATAGTA TAGATAATGA AATTTTAGAA ATTTTCAAAA 264 0 

ATATAGATCA CGACTTCAGA GATGGAGTAT CTGAAGAAAT GATGAAACAT TTGAAAGTAG 2 700 

TGAAACAGAA TATAGACCAA ATAAAAAATC AAAATGATAT TTATGGTAGG CAAATTGCAG 276 0 

30 

ATATTAGAAG TATTATGAAA CAACAAGATG CAACAATTTT AGATGGAAAT TTTCAAATTA 2 82 0 

ATTGTAGCGG CGAAAATATG GTACAGGGTC TAGTTATACC TTCTAATTAT TTAGGAAGAA 28 8 0 

AAATGAAAAT ATTAAAAGAC CATATCGATG ATGGTATTAA AAAAATAGCA GACTATGTTC 2 94 0 

AAGGTATATA TGATGAATAT GCATCGAAAA TTGTCGATGT AATAAAATAT TTGATTAATA 3 00 0 

CAATTCCCAA AATACGTAAG AATTTAAGAC ATGCAATTGA AATGTTAAAT GTAAAAAAGA 3 06 0 

40 AAGAATTTTT GTCCCTGATT CCTAATGTAA CTTGTAATTA TATTAAAACT AAATTAGAAG 312 0 

AATTAGATAA TACTTTAGGC AAATGGGAGC CTTTTCTTAA TGATTTAAAA GCAGTGTCAC 318 0 

CAATTTTAGA TAACCATTTA GATGATATTG TTAAGAACAT GAAGCCTTTG ATTGTACAAA 324 0 

45 TGAyATwTGA ACCATCACAT TATGaCGATA TGTTTAATTC aAGAAAAGCT TTAACGcCAG 3 3 00 

TGTTCTCAAG CGTTTTATAA AGGTTG 3 326 
(2) INFORMATION FOR SEQ ID NO: 443: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

5 ACCTATAGCT GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAAC CCAG TTGCATGCCC 6 0 

AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA AAGTTCGAGA 120 

CGCAAATATT TCTTCATTAC CAAGTTCAGG ACGTTCATGA ATTCTATGAC TGATTTCGAT 180 

10 ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGTTTTTCAC TCATTTTTTA 24 0 

CTATCTCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTGTT CTTTCAAAAT 3 00 

GAATTATTAC TTATTCTATC GGTTTTATCT CATGATGTCA TCTAGTTTTT CTTTATTTAC 360 

J5 

AAAATTTTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAGTTTCA AATTGTAAAA 420 

CATAAGCCCT ATTGTTACAA TTTAAGTATT GTTAGAAACA CAACTTACAA CAAACTTAAG 4 80 

20 TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTTTTGA GTAGTTAATA GGAGGAAATT 54 0 

CAAAATGACA AAAATGAATG TTGAAAGTTT cAATTTAGAT CaTAcTAAGG TGGTTGCCCC 600 

ATTTATTAGA TTAGCGGGAA CGATGGAAGG ATTAAACGGA GATGTCATTC ACAAATACGA 660 

25 CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG CCCGGACTAC ATT C ATT AG A 720 

ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT TAAGTCCTAT 7 80 

GGGTTGCCAA ACTGGTTTCT ATGTATCATT TATTAATCAT GATAATTATG ATGATGTATT 84 0 

30 

AAATATTGTT GAAGCAACTT TAAATGATGT GCTAAATGCT ACTGAAGTGC CTGCTTGTAA 900 

TGAAGTACAA TGTGG CTGGG CAGCAAGTCA TTCATTAGAA GGTGCTAAAA CTATCGCTCA 960 

AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA AATAAAT CTT 102 0 

35 

AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC CATTGTCCAA 1080 

CTGCTTTTTA TTATGCTTCA AAGTCAAAAA ATCGAACAAA TGAAAAAGTA AAATCTTTAA 114 0 

40 CATTTGTCCG ATTTATTTGA GAACCACTAT AATTTCTTAA TTAGTCCCAT TAACACGAAC 1200 

TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC TAAAGGTAAC 1260 

TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG ATTGATTGTA 13 20 

45 TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA TATCTCAAAG 13 80 

ATTCCTGTAA CAAGAACCGC TGTTTTTCGA AGTGGCAAAC ATTTCGGTAT GATATTTCTA 144 0 

AATTGTCGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT TCCTAATAAA 1500 

so 

TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT GAATTAATGT 156 0 

AGG CGATACA ACATGaTCAG CAATTGTTAT GCTTGAATCT AATTTTTTAA CAACATCGTC 162 0 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAAT CAATA TCATCCTCTT TTGTTAAACG 174 0 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 1800 

5 ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 1860 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 192 0 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCG TTTCAGCTTT 198 0 

10 

GTTTGCAAGT ACAACCATTT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 2 04 0 

ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 2100 

TCCAAGTGGC TGATTCATAT CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 2160 

15 

AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 2 22 0 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 22 8 0 

20 AATTGAAGAG GCAATTAATG GTATTGAATT GACAGTACCA GTAACAT CCC TTAAGGCATA 2 34 0 

TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGTCCT ACAACTGCCA CTTTATTTTC 24 00 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 24 6 0 

25 TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2 52 0 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2580 

TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 264 0 

30 

ATT AAC CATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 270 0 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 2 760 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2820 

35 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 2 8 80 

CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 2 94 0 

40 TTAATATCAC AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3 000 

GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3 060 

ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

45 

GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 3180 

CCTGCTTCAA C CATTTT ATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 3 24 0 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 3300 

50 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 3 3 60 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 34 2 0 
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TTTAATGCGC CGATGTTGAT GACCATGTCA 
TCTGTTTCAA ATGCTTTCGT CGCAGTTGTC 

5 CAAACTAACA CCTCTGAATC AGCTAGTCGC 

ACACATACAG ATTTAAAATG GTATGCTTTC 
CGTGTTGACT CAGGCTTCAA TAAAGTGTGA 

W CTCCTCCTTT ATGGTTGATT ATAAAAATAC 
TCATTTAAAT ATCAGGTTAG ATGTTCGCTT 
TTAATTCATT TCCCATAAAC AAACAATACA 

15 

GACTTGACAA CAAGCTAATC AGGATTATAT 
AATTCGCAAA AAAGGGAAAC AAATGTTATC 

20 ATAACAAATG ACAAAAGGTA CA CCA CAT AT 
AG TATTAATG CCTGGCGATC CGCTACGTGC 
TGTTGAACAA TTTAACGATG TACGTAACAT 

25 AGAAGTTTCT GTAATGGGTT CTGGTATGGG 
GTTATACAAC TTCTTTGATG TAG AT A CAAT 
AGAAAATGTT AACTTATACG ATGTTATTAT 

30 

TGTAGATCAA TACAATATTC CAGGTCATTT 
TAAAGCTAAA AATGTCGCTG ACCAAATCGG 
TTCTGATACA TTTTACAATG CCGATCCAAC 

35 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT 
AGCACTTGGT ATTTTCACAG TAAGTGATCA 

40 AGAACGTCAA AATTCATTTA CACAAATGAT 
TATTTAAATT GACTTTAATT GCTCTTTAAC 
TCTGGGCGTA TCCCCATTTA TGTGTTGGCT 

45 AGCTTAAGTA AGATTTAAAC ATCCAATTAT 
ACATATCTTC TGATACATCA TCGCCAGCAT 
GTTTTGAGAT ATTTGTGAAG TCTTGTGATA 

50 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT 
TTCCTCCTAC CGCTAAAATT CTTTCAGCTA 

55 
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ATTTCATCTG CACCATTTTG AATCGCATCT 3 54 0 

GATGCACCTA ATGGGAATCC TATTACCGTA 3 600 

TCTGCTGCAT ATTTAACATG CGTTGGATTC 3 66 0 

GCTTCATCGA TGATTTGATC GATTTGCGTA 3 72 0 

TCAATCAATT TTGCACTATT CATTTTCTAT 3 78 0 

GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 3 84 0 

TTTATGTAAC CGCATACATA TACTATTACA 3 90 0 

ATTGAACGTG ATATCTTCAT TATGAACGAT 3 96 0 

TTTTATAATT CTTTAATTCT ATAGTACAAA 4 02 0 

TTAAAATTAT TAATGAATAT TAAGGAGAAG 4 0 80 

TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414 0 

AAAATATATT GCTGATAATT TTTTAGAAAA 4 200 

GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4260 

TATTCCAAGT ATTGGTATTT ACTCATATGA 4 32 0 

CATTCGTATC GGTTCTTGTG G CG C ATT A CA 4 3 80 

TGCACAAGCT GCATCAACTA ATTCAAATTA 444 0 

CGCGCCTATC GCTGACTTCG AG TTAGT AAC 4 50 0 

TGCTACTACA CACGTAGGTA ACGTATTATC 4 56 0 

ATTCAATGAT GCTTGGAAAA AAATGGGTAT 4 62 0 

ATATTTAAAT GCGATTCATG CTGGTAAAAA 4 6 80 

TATTTTACGT GACGAAGCTA CTACACCTGA 4 74 0 

GGAAATCGCT TTAGAAATCG CAGAGTAACT 4 800 

AATGCGATTA AACTCAAAAA GCCAACACAT 4 8 60 

TTTATTTATA TTATTACTTA TCTGTAGATT 4 92 0 

GTTTATCTAC TGATGTTTGC ATACCTATAA 4 980 

TACCAGCAAT TTCGATTGCG TTTTCTAATT 504 0 

ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 510 0 

CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 

ATTCGTCTAC ATATTGGCTT GCTTCATTAT 52 20 
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AATTGTGTAG CTTTGTGTAA G 53 01 
(2) INFORMATION FOR SEQ ID NO: 444: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

/5 GAAGATGAAA GCTATATTGA AGATGATGTT ACTAAAAAGG AAGCTATTTT AAGTATGCAA 6 0 

ATTCCTAAAG GTTTCTCTCA AAAATTAAAA GAGAACCGTT TAAAAGAAAC GATACAGTTA 120 

TATGGTAGAG ATGACTTTAT AGGTGGTATT GCTGTAGAAA TTGTTAGTAG TTCATTATAT 180 

20 GAGCAGCAAA TTCCTAACAT TATTTATGAA CACCTTGAGG ATATGAAACA GCATCAATCC 24 0 

ATCGATGCTA TCAACAAGTC CTATCATAAA CATACACCTG AATCTAAAAT CAAATTTGTG 3 00 

TCGCTTACTA AACAAGCACA ACACTCTATT TCAATTAGCT TAATCTTTGC GGTGATTTTA 360 

25 TTTGTTAGCG CTGTTCAAGT AGTACTTCAT TATCGTTTAA ACCAACAAGC AG CATTG CAA 42 0 

CG ATT AT CAC AAT AT CATTT AAGCCGTTTC AAACTATATA GTACTTATGT AATGACACAT 4 80 

ACGATTTTGT TATTGTTGGT ACTATTGGCA GTTAGTCTAT ATTTGTCTCA ACCACTCAGC 54 0 

30 

TTAATATTTT ACTTAAAATC ACTGTTACTT ATATTGATTT ATGAGATAGG TATCGTCTTT 6 00 

ATCTTATTCC ATATTCAAAC AATAAGTCAT CGATTATTCA TGACATTTAT ATATGCACTT 66 0 

35 GCTATGGGTA TCGTATACTT GATTATTTTC ATGTAAAGGA GCGTAACTGA TGATAGAAAT 72 0 

TAATAACCTT TCAAAGCGTT ACCGTAACAA ACAGATTTTC AATCATTTAA CTATGTCCTT 780 

TGATAGTAAT CGTTTAACCG TATTACTTGG TGATAATGGT GCTGGAAAAT CAACATTACT 84 0 

40 TCGTATGATT GCTGGTATTG AAAAAGCTAA TGATGGAACT ATCAACTATT TCGGCGAAAA 900 

ATGGAATCAA AGACaAATAC aAAATCACAT CGGTTATGTG CCACAAGACA TTGCGTTATT 960 

TGAACACATG ACAGTGGCTG AAAACATTAA ATTTTTTAAA TCACTTTGTA AAAATCCAAT 102 0 

45 

TAACGATACA ACTATCAACG AATATTTACA GCAATTAAAC TTTGATGATA CGTCTGCCAA 1080 

AGTATCTACA TTGTCCGGTG GGAATAAACG TAAAATTAAT ATATTAGTAG GTTTACTAGG 114 0 

TCAACCTCGA ATTCTCATTT TAGATGAACC GACAGTTGGT ATTGATTTAA AATCTAGACA 12 00 

50 

TGACATCCAC CAACTACTTA ACATCATGAA ATCTAAATGT TTAATTATAT TAACTACCCA 126 0 

TCATTTAGAT GAAGTTGAAG CACTTGCAGA TGATATCAAG TTAATTGGCC AAGATCCTTT 132 0 

55 
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w 



CCCAAGCTGC GTATGATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TGCAAGTATG 144 0 

ACGTTGGGTC TACTGCATAT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT 150 0 

GAGGCGCTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 156 0 

TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 1620 

TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 16 80 

CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 174 0 

AATGCGCACC ACCACCGTGA TATTGTC CAT ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

JS AGCTTGCGTC TTTCGCATGA CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 1860 

TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 192 0 

TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 198 0 

20 GATTAATATG CTGTTGGCTT TGACTCGCTG TGTAGTCATT GTTATCTCCT GCTGTTGCTG 2 04 0 

GATTCACATA TGTTTGGCCG CTTCCATTGG CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT ATATTCTTTA TTATCAAACA 216 0 

25 TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 222 0 

AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 228 0 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 2 34 0 

30 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 24 0 0 

TAGTTTTCAA TTAAACTGTA CTGCACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT 24 6 0 

35 TAATTCTCAT TTTCCAACTA TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 2 520 

TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 2 580 

CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACGCTA 2 64 0 

40 ATGCATGATA GTAATCATCC ATCAAATCAT ATCAACCAAA TTCCATTATC AATCGCTATT 27 0 0 

GAT7GTCATT CAACTTTCTA ATAGTGATAT GCTTCTCAGG CTTAAAAATC GTCATATCTT 2 76 0 

TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 2 82 0 

45 

TAATATATAA CGTTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 28 80 

CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCATCTAAT AAATAAAATG 2 94 0 

CATTCATTAA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3 000 

50 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT 3 06 0 

TGCCTAAAGG TTCAATCGTT ACACCTTCCA AT AAATTTG C TGTTGCTGGA TAAGGAGAAA 3120 
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CACCAATAGG ATTATCTTTT ATAACCACTT 

GTCCAGTTGC ATCGGAAACA TGATAATGCA 

5 ATACAGCTAC AACATGTATT TGGGATGCTT 

ATCCCAAAAT CCATGTCACA ATTTCATTTT 

TTGATCCATA TGAACTATAA CCTCGGAAAT 

w 

TAACACCATC ACCATAAATA AAACCTTCCA 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT 

GCGAACCATC TAATGGATAA TCATAATCCA 

is 

AAGTTTGTAT TGTGAATCCT GTGCACATTG 

AATCACTTTC AAATAAAGCT GTTTCACTTA 

20 AATTAATTGA TATTCATTCT CAATAACTGT 

GTGTGAGTGA ACGATCATCT ATACATCCGT 

TTTTTACAAG GAGTGACATC TGTGAATAAC 

25 ATTGCGATTA TGTTGTCTGC ATT AA CAT AT 

GGACCTCTCG TTGGTCAAAC ATATCAAACC 

TTAACTTCCT TCGCCACAGG TATCTTCATG 

30 

GGACAACTGA GAATGACATA CATGGGTCTC 

ATTATATCGG ACATCACTGC ACTGCTCATC 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG 

35 

AGACGAGCGA TTAGTTATCT AATGATTAGT 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA 

40 ATTGCTTTCA TTGCCATATT GCTTCTAAAA 
CACCGTCATC CATTCGATTA CAAAGGTATG 

ACATTATTGT TAACACAAGG ATTCGAACAA 

45 

CTGAGCATTT TTATCaTCAc TACGTTGATA 

CCTTTTATTG ATTTCTCAGT ATTACGCAAC 
TTTGTTTTAA ATAGCGGTCT AGGCGTAACA 

SO 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT 
GCGATGATTC GTTTAGGTGA AAAAGCAACA 
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CACCCTCTTT AAATGAAACT TCGACTGTAT 324 0 

ATGGCGGAAC TTCACCGATG TCATTTAAAT 3 300 

GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3 3 60 

GCGTAATATT CATCGCGTCC GCTTTGTGTG 34 20 

ATTGTGTTGA AATGGCAACG CcATGTtCAT 34 80 

TATCTGTTCC TGTGCCAATA AAGCCATATT 3 54 0 

TCCAACGATA ATTTCTAGGC GTCACTGCTG 3 600 

TCGTGCGTCC AAGAAGTACT TGATTATTTA 3660 

TTCTCACTCC TCTGTACCTT CATTTACTTT 3 72 0 

AACATACTAT AAAAAATCAA TTATACAAGC 37 80 

GGTATGATAT GTAAGGAAAT CATGACTTAT 3 84 0 

TCACTTCATC TCATGACTTT CTATATTTAA 3 90 0 

ACACAATCTT CACCACGCAG TAATATTATT 3 96 0 

TGGTTGTTTG CACAATCATT TATTAATATA 4 02 0 

TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4 080 

GTGGCTGCAG GTGATATTGC TGATAAAATA 414 0 

ATAATCAGTA TGTTTGCATC TCTTCTATTA 420 0 

ATCGGTAGAA TTTTACAAGG TCTATCAGCA 4260 

TTAAATAATC AATTTAAAGG AGAACATTTA 432 0 

ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 43 80 

TGGCAAACGA ATTTCATCAT TAGTATAGTC 444 0 

GGCACACCTG AAAAAGTAAG TCAACATAGC 4 50 0 

TCGATTTTCG CTGTTATGAT TGGTAGCTTT 4 56 0 

GGTTGGTTTA GTACATTTTC AATCATTTGT 4 62 0 

TTCATCATCA TCGAACGTCG ACATGAAGTA 4680 

CGTCCGTTCA TTGGTGCATT TTTAAATAAC 4 74 0 

GTGGTCTTTT TCATATATGC TCAAACACAC 4 8 00 

GTTACATTGC CATATGCCAT TGTGGCAGTT 4 86 0 

TTACGTTT CG GTGGCAAATT GATGCTCATC 4 92 0 
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TCACAATATG TCATTGCAGT TATCATTGGT TTCGTCATAT GTGCGATAGG TAATGGTTTA 5 04 0 

GTCGCAACAC CTGGACTTAC GATTGCAATT TTCAGTATGC CTAATGAAAA AGTTGGTTTA 5100 

GCTACAGGAT TATATAAAAT GAGTGGTACA TTAGGTGGCT CCTTTGGTAT AGCACTAAGT 5160 

ACTACAGTTT TCAGTATGTT ACAACTAAAC TATGCACCAA GTGTAGCTGC AACCGTAACA 522 0 

TTTATAGTCA GCATTGTATT GATGATCCTT GGCTCATTGT CTGCATACAT GATCATTCCA 528 0 

AAAACAGTTA AATCTTAAAT ATAATAGAAG AATTATGTTT CGAAATATCT TTATCACTTT 534 0 

AAAATGATAT ACAAGAAATC CAAGAAAAAT AAGCGAACTG AATAAATAAA GATTCAATTA 54 0 0 

ACGCATCAGT ATTAGGATTC ACTCTAAAAC GATTAATAGT TTTATAAGAA GGTGTTTGAT 54 6 0 

CTTGAGCTAA CCACATCATT CGAATACTGT CATGAAGTAA TTTCTCTATT CTACGACCAG 552 0 

AAAATACAGA TTGAGTATAT GCATATAAGA TGATTTTTAA CATCATCTTT GGATGATAGG 5580 

20 ATGTTGCGcC ACGATGATGT CTGAATT CAT CGAATTTGCT ATCAGGTATC GTTTCAACAA 564 0 

TTTCATTAAC ATGTCGCGAA ATATCATTTT GAGGAATTCT AACAGAAGTT TTTATTGGTA 57 0 0 

GTGTAAGTTG GGCAAAGTGT CTTATTTTTT TAAAGTATTT CAAAGTAAAA TTACATGTTA 57 60 

25 ATACGTAGTA TTAATGGCGA GACTCCTGAG GGAGCAGTGC CAGTCGAAGA CCGAGGCTGA 5 82 0 

GACGGCACCC TAGGAAAGCG AAGCCATTCA ATACGAAGTA TTGTATAAAT AGAGAACAGC 58 8 0 

AGTAAGATAT TTTCTAATTG AAAATTATCT TACTGCTGTT TTTTAGGGAT TTATGTCCCA 5 94 0 

30 

GCCTCTTACT CTAATTATAT TCACTATCAA TTAGACAAAA TGGCCATTTT CAAATATCAC 6 000 

GCGTTGTTTC TGACCTTGAA TATATTTATT ATAATTCTCT TTTTGAAAAT CAGTTAACAT 606 0 

3s TAATTTAGAT GTACCGTATT TTAACACTTT TTGCATTGTT TCTATTCTCA TTTTTCTAAA 6120 

TAACCATCCA TCTTTTAACA CAATACGATT AACAGCATCA TATGATAATT CTACTGTTTC 6180 

TTTAATTTCA AATGTCTTGA ATGAAATAAT CGTGCACATT AAAAACGTAT CACCAAAGTA 6240 

40 ATAAACATCT AAATCATCAC GTTTATGTTG TCCAACTAAC AAACGACCAT ATTCGAACTC 6300 

TTTTTCTGGA TATTTCAATT CTAAAAAACT AATAATCTCT TCTTCTTTTA ATTTGAATTG 6360 

CATTTAAAAA CATCCTCTCT TAAGTTTTAA CAAGCCTTAA TTAAAAATTT TTTCAATCAC 6420 

45 

ATAGTTCAAT ATACATCATT TCGTTATGTT TTTTAATACT TTGTTCAAAA ACAAATATTT 64 80 

TATTCTTTAA AATAATGACT TTTGTATTTT TAATCACAAT AAACATTTTA AAATTCTTGT 654 0 

TATCATAATC ATTAAAAGGT ATTAACCTTA ATAATATATT CTCTCGTCTC AACCTTAATC 6 6 00 

SO 

GTATACTTCA GACGTCTGTT TGTAGACAAT AAAAGTCATT CACGTCTTCA TATGTCATCA 6 66 0 

AATGTTTATC ATGATATGAT GAATATAATA ATCGGGTATA TAACTGTATG ATTAATTACA 6 720 
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TGGTGTATCA GTTACAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC 684 0 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6 90 0 

TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6 96 0 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 702 0 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 708 0 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 714 0 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 72 0 0 

TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 732 0 

GACTCGTGTC GCTGGATTTG TCGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 73 80 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 744 0 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGG CATTT TACGAGGACT 750 0 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 7 56 0 

25 TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 762 0 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGCAC TTAGATAAAT CCATTGATAT 76 8 0 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 774 0 

30 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGGCCC TTAGCGTTGC TTTAATGCTG 78 00 

AGGGTTTTTA AT CAT AATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 786 0 

ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7 92 0 

35 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 79 8 0 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 804 0 

40 TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTCGCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 816 0 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

45 CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 82 80 

ATCTTTATGG GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 834 0 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 84 0 0 

50 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 84 6 0 

TGGATACTGG TTATATTCAG CTGCACCGCA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8 520 
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25 
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AGCAGAGAAT 


CCATTCCGTA 


ATAAAATTAC 


GTGGTTACAA 


ATTATTTCAG 


GTTTCTTCTT 


8640 


TGCATTTGGT 


GCTTTAACAT 


ATCTTATTTC 


AGCACAACCT 


AATATGAATG 


GTTTAGCAAC 


8700 


TGGATTTATT 


CTTTCTCAAA 


CATCCGTTGT 


GCTTGCTACA 


TTAACTGGTA 


TTTATTTCTT 


8760 


AAAACAACAT 


AAAACGTCAA 


AAGAAATGGT 


TATTACAATC 


ATCGGCTTAG 


TACTCATTTT 


8820 


AGTAGCCGCT 


TCTGTTACAG 


TATTTATAAA 


ATAAGGAGTG 


TAGATGTCAT 


GAAAAAATCA 


8880 


GCTGTTTTAA 


ATGAACATAT 


TTCAAAAGCA 


ATCGCGACAA 


TTGGTCATTT 


TGATTTATTA 


3940 


ACGATTAATG 


ACGCTGGCAT 


GCCAATTCCA 


AATGATCATC 


GTCGTATCGA 


CCTAGCTGTA 


9000 


ACTAAAAACT 


TACCACGCTT 


TATTGATGTC 


TTAGCTACAG 


TGTTAGAAGA 


AATGGAAATC 


9060 


CAAAAAATAT 


ACTTAGCAGA 


AGAAATAAAA 


GAACATAACC 


CTACACAATT 


GCAACAAATT 


9120 


AAACAATTGA 


TTTCATCGGA 


AATCGAAATC 


ATTTTCATTC 


CTCACGAAGA 


AATGAAAAGT 


9180 


AACTTAGCTC 


ACCCATTAAA 


TAAAGGTAAT 


ATTCGTACTG 


GTGAAaCAAC 


GCCCTACTCT 


9240 


AATATTGCAT 


TAGAATCGAA 


TGTTACTTTT 


TAAAAGTTAT 


AACTTGAAAG 


GAGCGTACAC 


9300 


ATGACCAACA 


AAGTTGTTAT 


TTTAGGTTCA 


ACGAATGTCG 


ATCAATTTTT 


AACAGTTGAA 


9360 


AGATATGCAC 


AACCAGGCGA 


AACATTACAT 


GTTGAAGAAG 


CACAAAAAGC 


ATTCGGCGGA 


9420 


GGTAAAGGTG 


CCAACCAGGC 


TATTGCCACT 


GCACGCATGC 


AAGCAGACAC 


AACATTTATT 


9480 


ACTAAAATTG 


GCACTGATGG 


CGTTGCTGAT 


TTCATCTTAG 


AAGATTTTAA 


AGTAGCTCAT 


9540 


ATTGATACAT 


CATATATTAT 


CAAAACAGCT 


GAAGCAAAAA 


CGGGCCAAGC 


CTTTATCACT 


9600 


GTGAATGCAG 


AAGGACAAAA 


CACCATCTAT 


GTTTATGGTG 


GTGCGAATAT 


GACGATGACA 


9660 


CCTGAAGATG 


TTATTAACGC 


AAAAGACGCT 


ATAATCAATG 


CAGACTTTGT 


CGtTGCACAA 


9720 


TTAGAAGTAC 


CCATCCCGGC 


TATTATATCT 


GCATTTGAAA 


TTGCCAAGGC 


ACATGGTGTG 


9780 


ACGACAGTAT 


TAAATCCTGC 


ACCAGCGAAA 


GCATTACCTA 


ATGAATTATT 


ATCATTAATC 


9840 


GATATTATTG 


TGCCAAACGA 


AACAGAAGCC 


GAATTGTTAT 


CTGGGATTAA 


AGTAACTAAT 


9900 


GAACAATCTA 


TGAAAGACAA 


TGCCAATTAC 


TTTTTATCTA 


TAGGCATTAA 


GACTGTTTTG 


9960 


ATTACGCTAG 


GTAAGCAAGG 


TACATATTTT 


GCTACTAAAA 


ATCAAAGCCA 


ACACATCGAA 


10020 


GCTTATAAAG 


TAAATGCGAT 


TGATACAACT 


GCTGCAGGCG 


ACACA x n A I 






GTCAGTCGCT 


TAAACAAGTC 


G CAAG ATAAC 


TTAGCAGATG 


CTATTGATTT 


TGGTAATAAA 


10140 


GCGAGCTCAC 


TCACTGTACA 


AAAACACGGC 


GCG CAAG CAT 


CTATTCCTCT 


ACTAGAAGAA 


10200 


GTAAATCAAG 


TTTAAATGAA 


TCAAACACAG 


CTATGATATG 


AAGGTTT AG C 


ATATAACATG 


10260 


CAACATTCGT 


ATATCATGGC 


TGTGCTTTTT 


TATCTTTATA 


AAACATCATC 


TATTAGAAAT 


10320 
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TTTGTAATCT TTTTAACTTC CAAATTaTCG CATATAAATA TGCTATATTA ATGATAATAA 1044 0 

TTATCAATTA AAAGGAGGTT ATGCTATGTC TAAAGAAGCT GGTCATACAT TTTTAGCTAA 10 500 

5 ATTAGGAAAA ACTCGTCTAC GCCCCGGTGG TAAAGAAGCA ACAGATTGGT TAATACAACA 1056 0 

AGGGGCATTT TCACAAGATA AACAAGTGTT AGAAGTGGCA TGTAATATGT GCACAACATC 10620 

TATTTATCTA GCTCATACAT ATGGCTGTCA CATTCAAGGC GTTGATATAA ATAAGAAAGC 106 BO 

10 

ATTAGAAAAA GCACAGGAAA ACATTTCAGC AGCAGGTCTT GAATCATATA TTCAAGTTCA 10 740 

ACAAGCGAAT GCTGTTAAAT TGCCCTTTGA TGACAATCAA TTCGATATCG TTTTAAATGA 10 800 

15 AGCAATGTTA ACAATGTTAC CCATCGCCAT AAAGGAAAAA GCATTACGCG AGTACTACCG 10860 

AGTCTTAAAG CCTGGGGGTA TCTTGTTAAC ACATGATATT GTCATCGTTA ATGAATCACA 10 920 

TGCCACACAT GTTGTTAAAT CATTATCTGC AGCAATTAAT GTCAATGTCT CACCGCAGAC 10 980 

20 GAAACTTGGC TGGTTAGATT TATATAATCA AGCTGGTTTT AATCATGTGC ATTATCATAC 11040 

TGGTCCAATG AGTTTAATGA CACCAAAAGG TTTAATTTAT GACGAAGGTA TTGTTGGAAC 11100 

TATAAAGATT ATCAACAATG CTTTGAAAAA AGAAAATCGA CCAATGTTTT GTAAAATGTT 1116 0 

25 

TAAAACGATG ACTAAATTGC GTAAAGATAT GAATTATATT ACTTTTGTCG CTAAAAAAGA 11220 

GCACTAAATA TAATGCCACT AACTGTACTT TGTATCTATG TTTGACTATC ACTTTAATTT 112 8 0 

CTTTGTGACA CTAATCATCT ACTTAACAAT ATCGTTATCG TTGATTAGTA AGTCATCAAT 11340 

30 

TTTGG TT AAA GACTTTCATA AACACTCAAA CA'XTAACACT ATA CAT AG TT AGTGG c ATT A 114 0 0 

TTTTTTyCTn AAAATTTTAA CmTCmCGGGr TtGGGAmCrG AAaTGrtAwT TcGCrmAAtT 114 6 0 

35 TAwTcT 114 6 6 

(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 44 5: 

TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 6 0 

ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 12 0 

50 

AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 180 

CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 24 0 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 360 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACTTAAAGAC AGCAAGAnCC 420 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 4 30 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC S4 0 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 600 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 660 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 72 0 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGCG TGTGAAATCA GCAAATGATG 780 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACGTGAAG 84 0 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

CTCAAAAAGA TGCTGAAAAG AAAGTGGCGC CAAAAGTTGA GGCTCCTCAA ATTCAATCAC 96 0 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 1020 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAG ATT CAT 108 0 

TTAACTATGG TTACAAG T A T TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 114 0 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATAC CAAGG TGCGATTGAT CAAACAGTAT 12 0 0 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 12 6 0 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 132 0 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 13 80 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 144 0 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATCATTTTT TAAATAATTT TCGTAACAAA 15 0 0 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCATTTTCCA 156 0 

40 TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 16 2 0 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 16 8 0 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 174 0 

ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 180 0 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 186 0 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 192 0 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA G AAAG AAAG T GATTTCTATG 1980 

ATTAAAAAtA AAATATTAAC AGCAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 204 0 



35 



45 



50 



1358 



EP0 786 519 A2 



CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 216 0 

GGATTTGGTA AAGGnT 2176 
5 (2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

w iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 446: 

15 

AAAAGCATGG CTTAAATGAA GTACGCTATA ACAAATTACA AGAACATGCT ATTGTTATGC 60 

ATCCGGCACC TGTGAATAGA GGAGTAGAAA TACAAAGCGA TTTAGTAGAA GCTTCAAAAT 12 0 

2Q CAAGAATTTT TAAG CAAATG GAAAATGGCG TTTACTTAAG AATGGCAGTC ATTGATGAAT 180 

TATTAAAATA GGTAAGGGGA CGAAAATGAT GAAATTAATT AAAAACGGTA AAGTATTACA 24 0 

AAATGGCGAA TTACAACAAG CAGATATTTT AATTGATGGT AAGGTAATTA AACAAATTGC 3 00 

25 ACCTGCAATT GAACCAAGCA ATGGTGTTGA CATCATAGAT GCGAAAGGTC A CTTTGTG T C 360 

ACCTGGATTT GTCGATGTTC ATGTTCATTT ACGTGAACCT GGTGGTGAAT ATAAAGAGAC 420 

AATTGAAACT GGTACTAAAG CTGCTGCTAG AGGCGGATTT ACAACTGTAT GTCCAATGCC 4 B0 

30 TAACACAAGA CCGGTACCAG ATT CTG TAG A ACATTTTGAA GCTTTACAAA AATTAATCGA 54 0 

TGACAATGCT CAAGTACGTG TATTACCTTA TGCTTCAATT ACAACACGTC AATTAGGTAA 6 00 

AGAATTGGTT GATTTCCCAG CACTAGTAAA AGAAGGTGCC TTTGCGTTTA CAGATGACGG 660 

35 

TGTAGGAGTA CAAACTGCAA GCATGATGTA TGAAGGCATG ATTGAAGCTG CAAAAGTAAA 720 

CAAAGCCATC GTAGCACACT GTGAAGATAA TTCATTAATC TATGGTGGTG CAATGCATGA 7 80 

AGGGAAACGC AGTAAAGAGT TAGGTATACC AGGTATTCCA AACATTTGTG AATCTGTTCA 84 0 

40 

AATCGCAAGA GATGTACTAT TaGCTGAAGC AG CAGGTTGT CATTATCATG TATGTCATGT 900 

TTCTACTAAA GAAAGTGTTA GAGTCATTCG TGACGCTAAA CGCGCAGGCA TTCATGTTAC 960 

45 AGCTGAAGTT ACACCACACC ATTTATTGTT AACAGAAGAT GATATTCCTG GTAATAATGC 1020 

CATTTATAAA ATGAATCCAC CATTGAGAAG TACTGAAGAT AGAGAGGCTT TGTTAGAAGG 10 80 

GTTACTAGAC GGTACAATTG ACTGTATCGC AACAGAcCAT GCACCACATG CACGTGATGA 114 0 

SO AAAAGCACAA CCAATGGAAA AAGCaCCATT CGGAATTGTT GGTAGTGAAA CAGCATTCCC 12 00 

ATTATTATAT ACGCATTTTG TAAAAAATGG TGATTGGACA TTACAACAAT TAGTAGATTA 126 0 
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TTATGCAGAT TTAACAATCA TTGATTTAGA TAGTGAACAA GAAATTAAAG 



GAGAAGATTT 



1380 



CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA GTTTATGGAA 



ATCCGATCTT 



1440 



AACAATGGTT GAAGGCGAAG TTAAATTTGa GGGGGATAAa TArTATGCAA 



gCAAACGTTA 



1500 



TCTAGTGTTA GAAGACGGTC TTTTACGAGG CTACCGTTAG GTCTGATAAC 



TTACTGA 



1557 
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<2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 447: 

GCTAGAAATm TTGmATGaCA ATACAACTCT GTTAAAaTGA TGGACGTAGA CAAATATGCG 6 0 

TATTGACGCT TTATTTTAAA AATTaACATG CTTATAACAT GTTTATAGAA GGAGATTAAC 12 0 

CTATGAACTA TCaAGTTCTT TTATATTATA AATATATGAC GATTGATGAC CtGAACAGTT 18 0 

TGCTCAGGAT CACTTAGCCT TTTGTAAAGC ACACCATTTA AAAGGTAGAA TTCTTGTTTC 24 0 

TACAGAAGGT ATT AACGG CA CATTATCTGG TACAAAAGAA GAAACCGAAC AATATATGGC 3 00 

ACATATGCAT GCCGATGAAC GATTCAAAGA TATGGTGTTT AAAATTGATG AAGCTGAAGG 36 0 

ACATGCTTTT AAGAAAATGC ATGTACGTCC TCGAAAAGAA ATCGTTGCTT TAGATTTAGA 420 

AGATGACGTC GATCCAAGAC ACACAACTGG CCAATATTTA TCACCTGTAG AATTTAGAAA 4 80 

AGCTCTTGAA GATGATGACA CAGTCATTAT TGATGCACGT AATGATTATG AATTTGATTT 54 0 

AGGTCATTTC CGAGGTGCAA TTCGTCCAAA TATCACACGT TTTAGAGATT TGCCTGACTG 6 00 

GATTAAAGAG AATAAAG CGT TATTTGCAGA TAAAAAAGTG GTTACGTACT GTACTGGTGG 6 60 

CATTCGATGC GAAAAATTTT CTGGATGGCT TTTAAAAGAA GGTTTCGAAG ATGTAGCTCA 72 0 

ACTTCATGGC GGTATTGCTA CATATGGTAA AGATCCTGAA ACAAAAGGTG AATATTGGGA 780 

CGGTAAAATG TACGTATTTG ATGACCGTAT CAGTGTTGAT ATCAACCAAG TTGAAAAnAC 84 0 

AATTATTGGT AAGGATTGGT TTGATGGCAA ACCATGTGAA CGTTATATTA ATTGCGCTAA 900 

CCCAGAATGT AA T AAA C AAA TATTAGTTTC TGAAGAAAAC GAAACTAAAT ATTTAGGTGC 960 

ATGCTCTTAT GAATGTGCTA AACATGAGCG TAATCGTTAT GTTCAAGCAA ATAATATTAG 1020 

TGATAATGAG TGGCAACAAC GTTTAACAAA CTTTGATGAT TTACATCAAC ATGCTTAGTT 1080 

TTAATTAAAT ACCTTTCAAA ACACGCTTTG AAAATCCGAT TTATAAAGGT TTTTCAAGGC 114 0 
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TAAATTTTAA TACTGCGGGG TGTCTTAAAA TGCACATTTT AGTAACAGGG TTTGCGCCTT 12 6 0 

TTGACAATCA AAATATCAAT CCCTCATGGG AAGCTGTGAC TCAACTAGAA GATATTATTG 132 0 

GCACACATAC AATCGATAAA TTAAAACTAC CAACCTCTTT TAAGAAAGTA GATAATATTA 13 80 

TAAATAAAAC GTTGGCATCT AATCATTATG ATGTTGTACT AGCTATAGGA CAAGCTGGTG 14 4 0 

GTAGAAATGC CATTACCCCA GAACGTGTCG CCATTAATAT TGATGATGCA CGTATTCCAG 1500 

ATAATGATGA TTTTCAACCT ATTGATCAAG CCATTCACTT AGACGGTGCG CCAGCTTATT 156 0 

TTTCAAATTT ACCaGTTAAA GCAATGACTC AAAGTATTAT TAATCAAGGA CTTCCTGGAG 162 0 

CACTTTCAAA TAGCGCAGGT ACATTTGTTT GTAATCACAC ACTTTATcAC TTAGGTTATT 16 8 0 

TACAAGATAA GCATTACCCT CACCTACGAT TCGGATTTAT TCaTGTGCCA TACATACCAG 174 0 

AGCAGGTcAT TGGTAAACCC GATACACCAT CTCATGnCCA TTGAGGAAAA GATnAGTTG 17 9 9 
(2) INFORMATION FOR SEQ ID NO: 448: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

ACTTGGTTTT TTATTGTTTA TAAATAAAAC TCACTTAATA ATGTTTT CAT AATCTTCTTC 6 0 

GACTACTTAA TTCTTTAAGA TATTCGTGAA AAGAGACATT ACACTAGTTA ATTTTCAAAC 12 0 

AATACAAAAA GCGTCTACCT CCTACATATA ATTGTAGCGG AGATAGACGC TTAATATTTA 180 

TTTAAAAATT ATTTTAAACC ACCGAATGTC ATAACATCAC GGGCAATCAT ACTTTCTTCA 24 0 

TCTGTTGGAA TAACGACAAC TTTAACTGGT GAATGAGGAT AGTTAATAAA TCCTTCTTTA 300 

CCACGTAGTA AGTTTTCATT TTTCTTAGGA TCCCAGTAAA CAC CCATAAA TTCTAAGCCT 36 0 

TCAAGAACTT TCGCACGAAT TTCTACTGAG TTTTCACCGA TACCTGCTGT AAATACGATA 420 

ACATCAACAC CATGCATTCT CG CAG CAT AT GATCCAATAT ATTTGTGAAT TTTAGAAGCA 4 80 

AATACATCTA AAGCCATTTG TGAACGTGCT TTACCTGATT CAGCTTCTTC TGATAAGTCA 540 

CGTAAATCAC TAGATGTACC TGATAATCCT AATAAACCTG ATTCTTTGTT TAAGATTTCC 600 

AATACTTGTT CAGCAGTTTT ACCTGTTTTT TCCATAATAA ATGGAATTAA AGCAGGGTCA 660 

AT ATT AC CAG AACGAGTACC CATTGTTACA CCAGCAAGTG GTGTGAAGCC a TTGATG TAT 720 

CAATAGATTT ACCGCCATCG ATAGCTGCAA TTGATGCTCC ATTACCAATG TGACATGAAA 78 0 
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TATGGCTTGT ACCATGGAAA CCATACTTAC GAATGCCATA ATCTTTATAA TAATGATATG 90 0 

GCAAGCTATA TAGATATGCT TTTTCAGGCA TTGTTTGATG GAATGCTGTA TCAAAAATTG 96 0 

5 CCACATGAGG GATATTTGGT AATAATTTAC GGAAAGCACG AATACCCAtC AAGTTaGCTG 102 0 

GGTTGTGaAG CGGTGCTAAT T c G CTTAATT CTTCAATTTC CTTTTCAACC TCATCAGTAA 10B0 

TAGCTACTGA TTCAGGGAAT TTTTCACCAC CATGTACAAC ACGGTGACCT GTTCCATCGA 114 0 

w 

TATCGTTAAT AT CATT AAT A ATATTGTGCG CTTTAAAAGC ATCCAACATG ATATCAACTG 1200 

CCTCAACGTG ATCCTTGATA TCTTGTACTG TTTTAACTTT TTCCCCGTTG ACTTCAATTG 126 0 

TAAAAATTGA ATCCTTCAAT CCGATTCTTT CTACTAAACC TTTTGTTACT AATTCCTCTT 1320 

15 

CAGGCATTCT AATTAATTGA A 1341 
(2) INFORMATION FOR SEQ ID NO: 44 9: 

0£? (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 44 9: 

T TTTG AAG AT ACTACCGATG AAAATAGACA AAAGATTTTT CAATATTTAT CACCTGAAgA 6 0 

30 AGTTGCAAAT TTCTTTGATC AATTAGATAT TGATGACGAT GAATATGAGT TGCTATTTGA 120 

TAAGATGAAT GCGACATACG CAAGTCACAT ATTAGAAGAA ATGTCATACG ACAATGCAGT 180 

AGATATTTTA AATGAGTTGA CTAAACCAAA AG TTGCTAG T CTTTTAACAT TGATGAATAA 24 0 

35 

AGATGACGCG AATGAAATCA AAGCATTACT TCACTATGAT GAGGATACGG CCGGCGGTAT 3 00 

TATGACGACG GAgTATTTAT CACTTAAAGC GCATACGCCT GTTAAAGAAG CATTATTATT 36 0 

GGTCAAAGCG CAaGCACCAG ACGCAGAAAC AATATATGTT ATATTTGTCG TTGATGATGA 420 

40 

TGGTAAATTA GTAGGTGTTT TATCGCTAAG AGATTTAATT GTAGCTGAAA ATGATGCTTA 4 80 

TATTGAAGAT ATTATGAATG AACGTGTCAT TAGTGTGAAT GTAGCAGACG ACCAAGAAGA 54 0 

TGTTGCTCAA GTTATGAGAG ACTATGATTT CATGGCTGTA CCTGTTATAG ATTACCAAGA 6 00 

45 

ACATTTGCTT GGTATCATCA CGATTGATGA TATTTTAGAC GTTATGGATG AAGAGGCTAG 6 60 

TGAAGACTAC TCTCGTTTAG CCGGGGTATC AGATATCGAT TCGACTAATG ATTCAATCAT 7 20 

50 TAAAACAGCA TTAAAACGTT TACCATGGTT GATTATTTTA ACATTTTTAG GAATGATTAC 7 80 

TGCGACAATT TTAGGGAGAT TCGAAAAAAC ATTAGAAAAT GTAGCGCTAC TCGCAGCGTT 84 0 
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TCGTAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA TTTAGAATTG CATTAAGAGA 960 

AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA ATATTATTTA CAATTATTGT 1020 

TGCAATATAT CATCAGCCAC TTTTAGCATT AATCGTTGCA GGAAGTTTAA CTTGTGCGAT 1080 

GACGGTGGGG ACGTTTGTAG GTTCGATGAT TCCATTATTG ATG AAT AAA T TAAAT AT CGA 114 0 

TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT GATATTATTA GTATGTTGAT 1200 

TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT TAAGGAGGAG TTATGGAGTT 1260 

TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT CCAATAATTG TCAATCGATT 1320 

AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG ATGGGGATTG TGATTGGAAA 1380 

TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT ATTTTATCAA CGTTAGGCTT 144 0 

TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT AAAGCTTTTA AAAAAGATAA 15O0 

ACGCGCACGT CAAGGACAAA ATGATGATG 1529 
(2) INFORMATION FOR SEQ ID NO: 450: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGCATCCAT ATCACCCAAT 6 0 

ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 120 

GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 180 

AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 24 0 

CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 3 00 

CACTCAATAA AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 360 

GCGGTCGCAC CTTATTCTTA ATATATATCT TAAATTGATC ATGTGATTTT TCTGTTTTCA 420 

CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 4 80 

TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 54 0 

GTAATTCTAC ATTAACTAAA GAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 600 

CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 660 

CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 72 0 
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CAGCAATATC TATTTGATTC ATTGCTGCAA CAACCTGTTC GTAATGAAAT ATATCACACT 84 0 

GAATCCAAGT CATTTCAACA TCATCTGTTT kTTTATTGTC TGGATATTTT GATATAGCAA 900 

5 AAAGTTCAGC ATCATTTTCA ATCACTTCAC TTAAATACTT ACCAATATAT CCTGTTCCAC 96 0 

CTGCTAATAA AACTTTAGGT TTCATCTAAA ATACTCCTTT AAACTGTAAC CAAAAAACAT 102 0 

ATTACTCCAC CTTTTAGTTA CATATATATT ATAATAGTAG CAAATGTTTT AAAATTTCAA 10 80 

10 AATACTGGAG GCTTTTTATG GCCCATATTA TACGTAGAGT TAGTATCAAA GATGTAGAAA 114 0 

ATTTCATTTC AATGTTAGCG AACATATACG ACGAATCTCC GTATATGTTC TACACACCAG 12 00 

GAGAATATGA TCCTAGCGTC ACATCGGCTA GTAAACAATT AGAAGAATAT ATCACTTCTC 12 60 

15 

CGCATAAAGT CATCTTCGTT GCTGAAAGTG ATGAACAACT CGTTGGCTTT GCCTTTGTTA 13 20 

ATACGACACC ATTTCAACGC ATTAAACATG TTGCTAAAAT TGATTTAGGT GTAAAGAAAT 13 80 

TATATCAACA TCGTGGAATT GGCCAAGCAC TTCTTGATGC CATTATGGCT TGGTGTTTAA 144 0 

20 

ACAATCAAAT ACACCGAATT GAAGCAAATG TACCACTCAA TAACCAACCT GCCCTCGAGC 1500 

TTTTTAAAAG TGCCGATTTT CAAATCGAAG G CG TTTT AAA AGATAAGTTA TTTATCGATG 15 60 

2S GTAAATATTA TGATGACTAT ATGATGGCTA AAATTCTTAA TTAAAGCTAT TTTATCATAA 16 20 

TCTTGTATCA GAATCGTATA ACAACGAATT TAATGGTTAC CTAATACATT ACTCATACTT 16 8 0 

ATCAATGTTA TCTAATCTCA AATAAATACG TACACTCTTA TTCATTTATC AAATTTAAAT 174 0 

30 TCAAAATAnA ACACCACTAA TGTGTAATTG ATTAACTATC AACTACGATT AGTGGTGCTT 1800 

TATATATGTG GTTAGTTTTC CTnACTA 1827 
(2) INFORMATION FOR SEQ ID NO: 4 51: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

45 

TGACGTTATT CTATGkTGTA CTCATTt TCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 

TTATTAAAAA GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 

50 TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 3 00 
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AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 420 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 480 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 54 0 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 600 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 4 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 52: 

GCACGAGTGA TTAAACGGTT AATCAATGAA ACATTTGATG CAAATTACAT TGAAGTTATT 60 

GAGGGAGGAA TTGAAGAAAC GCAAACGTTA ATTCACTTAC CTTTTGACTA TGTCTTCTTT 120 

25 ACAGGAAGTG nAAATGTAGG CAAAATCGTT TATCAAGCTG CCAGCGAAAA TTTAGTTCCT 180 

GTGACATTAG AAATGGGCgG TAAAtCTCCa GTCaTCGtTG ATGAAACAGC GAATATTAAA 24 0 

gTTGCTAGTG AGCGCATTTG TTTTGGGAAA TTCACTAATG CCgGCCaAAC ATGTGTTGCa 3 00 

30 CCAGATTACA TTTTAGTACA CGAATCTGTA AAAGATGATT TAATCACAGC CCTATCAAAA 3 60 

ACGTTGCGTG AATTTTATGG TCAAAATATA CAACAAAGTC CAGATTATGG CCGCATTGTA 4 20 

AACCTTAAAC ATTATCATCG TCTGACTTCA TTACTTAACA GTGCACAAAT GAATATTGTA 4 80 

35 

TTTGGTGGTC ATAGTGATGA GGATGAACGT TATATAGAAC CAACATTGTT AGATCACGTT 54 0 

ACAAGTGATT CAGCAATTAT GCAAGAAGAA ATTTTTGGTC CTATCTTACC GATTTTAACG 6 00 

TATCAGTCAT TGGATGAAGC AATAGCCTTT ATTCACCAAA GACCAAAACC TTTGAGTTTA 6 60 

40 

TATTTATTTA GCGAAGATGA AAATGCTACA CAACGTGTAA TAAACGAGCT ATCATTTGGC 72 0 

GGCGGCGCTA TTAATGATAC ATTGATG CAC CTAGCGAATC CTAAATTACC ATTTGGTGGT 7 80 

45 GTTGGTGCCT CAGGTATGGG ACGCTATCAT GGTAAATATT CATTCGACAC TTTTACACAT 84 0 

GAAAAAAGCT ACATTTTCAA ATCnACACGA TTAGAATCAG GTGTCCATTT ACCACCATAT 90 0 

AAAGGTAAAT TTAAATrCAT CAAAGCTTtC tTTAAAnATT AATT 944 

50 (2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 4820 base pairs 
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(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

TCTCCAGTAG ACCTTGTGTA TGAACAGTTT CTTTCATATG AATGAACATC GTTTTTAAAG 6 0 

TTTGTTTCAA CTCAGCCTTA TCAGGATAAT ATCTAGAGAC AGTCGTCTCT GGCATCTCCA 12 0 

TTGTATGATA TTTAACCTTA TGCAGCGACC CATGATTTTC GTTATAAACA AATGTATGAT 180 

TCACTTCATC GAAATCATGA TCTTCTCCTG CAATCCAAAA AACTGGTACT ACTTGTTGCT 24 0 

TATGTGTATC CGTTAATTCC TTAGATAAAG TAATGATTGa AAATATTTTA TGGAATGTAT 300 

ACAATGGTCC CCCGAAAAGC CCTGCTTGTT GTCCACCAAT CACAACTTTT GAACCATTAG 360 

CTAAATGTTG TATGTTTAAT TCTTGTTCAC TTGAAAGCTT GAAAGCTTTA AATCACTCAT 420 

ATATTCACGA AT AACATT CG CTAATGCCGC TTCTCTTCCA TTATTTTCTT TAGACATCCT 4 80 

TTTTTCAAAA CTAGTTTGTT GAGCTGCATC ATATTGAAAT AATCCTGTTA TTACAGGGTC 54 0 

ACTGCTCTTT ATTTTTGGTA TAAACTGATC TTTTTCATTT AAACTAACTA CTTTACAGTC 600 

CATGTTTTTT CTCCTTAAGT ACGCGATTAC AAACTATAGT ATAAAGTCTA TACCGGTGAT 66 0 

TGACAATTTT ACGGCTTGAA AATCAATTTA AT CAT GG AAA ATTTATAATA TTCATTGTTT 72 0 

TACATTTTCA AATCAATGAA AAACACAAGT GGTTTAATGT ATAATAATAG TAG TAAACAA 780 

30 

ATAAGGGGTA GATAAATATG AGTGAAATCA AACGTCTTGA AATTAATTAC AAAACTGACG 84 0 

AATTATTCGA AAACTTTAGA GCGTTTGGCA ACAAAGACTT ATACATGGTC AATGAGTTAA 90 0 

ACGGTCAAAT GATTGATGCA AGTTCAGATT CACCATTTTA TGGCATATTT GTCGaGATCA 96 0 

35 

ATTAGGAGCT AGAATGGCAT TACTAAAAAA AGGTGATGTC GAAGAAATCT ACTTCCCAGA 1020 

TTTTGAAGAT TATATATTAT TATGGAAGTT AGAAGTATTA CCAAAATATC AAAACAGAGG 1080 

40 GTACGCTTCA GAATTGATTG ATTTTGCAAA GAGTTTCAAT ATGCCAATTA AAGCCATTGG 114 0 

CAGAAATGAT TCTAAGGATT TCTTTTTACA TCATGGATTT ACAGATGTGG AAGCTAAAAA 12 00 

TATAGAGGGA CATGATGTCT TATTGTGGAA AC CATAAG AT AATAATATTC GACACTACGA 126 0 

45 GCATGAAAAT GCATCTTTTC GTAGTGTCTT TTTTACAATT ACTTTCTTAA GCTAATATAA 13 20 

GTaAATCATT TTCAAATTAT TTGTCTTAAC GTACAATATC ATTTAGTTGT TTCCATGrAT 13 80 

TAATTTCATA ATCAGGTATA ATTCCTGGAT TATGATCAAA TCCTCTAAAa TTAAACCAGC 14 4 0 

50 

AAGTAGCTAT ACCcGCATTG ATTCCACCTA GAATGTCAGA TGTTAnAGAA TCTCCaACTA 150 0 

TAATCGAGTG CTGtCTTTCA TCCTCACCAA T AT C ATT AAA AACATAATTA AAAAATTCCG 156 0 
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ACGGCGTCTG ATTTAACCTT CTCTTTTGCG 
AATCATGTCT TTTCGATAAT TCGACAATTG 
5 TAGCTTCTGC TAATCCATTT CTAAATAACA 

GATGATGTTT GAAGTAATTC ACAAATCGTT 
TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT 

10 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA 
AAAAATCAAC AATTGTATCA TCAAAGTCTA 
TCTCCTATAT TGTCTATGTA TCTAAATCTT 

15 

AAACTGAGTG CATAACATTA AAGTATGCTC 
TATATTAGAA TCCAAATAAT TT AC CT AG T A 
TACTTGTGCC TAATTTCACA CTATCATGTT 

20 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC 
CATAACATAT TGAAATTAGA ATCCGAATAA 

2$ GATGCTCACA ATGCTTGTGC CTAATTTTGC 

ATCTTTAATT GCGTTAAATA AACCTTCCAT 
GATAACAAAA ACGTGCGTAg y TTTTAAAT C 

30 GTTGTTATTT TAAAATAAAT TTGATGCAAT 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA 
ACACAATTAA GTTGCCTGAA TTCGTATTTA 

35 TCTATAATAG TGAAAAATAT AAACATTTTT 

TTTAGATATA ATAATTCTTG TGTTGTTAAA 
ATGGCCTTCT AAGCCATCGG TCGGGGGTTC 

40 

TTATACGCAT TACCAAACAA AAGAGTTCCG 
CAGTAATATA GTATGATGCG TCACCAAAAC 
ATTAAAACAA GCATGCTTAT TCGTTATAGA 

45 

ACACTTGTTT ATACATTTTT AATTAAGTAA 
CTGAACTACA CCAACACTAC ACCAAGATTT 
TTTTATTATT ATATTATTCC TATAGATTTT 

50 

GGGGCCCGAA ATCCTATAAT TATAATTATA 

55 



TTTCGGTTAC ACCATTAGTA ACAATATATA 16 BO 

TTTCTAATGT TTGATCAAAG TATTTAACTT 174 0 

CATCTGCACG ATGCCCATCA ACTTCCATTT 180 0 

CTGATAATAC TTCAGACTTC Gk TAATTTAT 1860 

TGaTTTTTTT AAATGkTAAA AAATCATCCy 1920 

TATAGTGAAn CGCCCATTCT TCTG cATCAT 1980 

TCAAAATATT TTTATATCCC AATTTCCCCA 204 0 

AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

GTTGTGCAGC TTGCACAGTA TTTGCGATTG 22 8 0 

CCTCACCTTT GAAATAATAC TGATTACTTA 234 0 

TTTACCTAAT AAACCTACGC CATTTTCAAC 24 00 

GCCATCATTA TTAATTGCTG CAGTTACGGT 24 60 

TGAAAACACT CCTTAAAATT TAAATTTGAA 252 0 

ACCGAAATGT TATT CGCTT A ACGTTTTGTT 258 0 

TAGTTTGTTT ATCCGCACAA CATCTTATAA 264 0 

AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

AGTCTTATTG AACCTTTTTA GATAAATAGC 276 0 

TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2880 

GAATCCCTCC CAGGACGTTT ATAGGTATTT 2 94 0 

TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 

GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTTTTTGAC ACACAATTGA CACGCGTCTG 3120 

TTTTGTGCTC AAATTTCATC TATACTGCAC 3180 

TTAACACTCA CCATTTGCAT GCGTAGAGAT 3240 

GATACTATTC AAAATTTTAG GGACTTTTCA 3 3 00 

TACATCTAAA AAAAATAACC ACGTCCATCG 3360 
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ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 34 BO 

CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 3 54 0 

5 TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTG TACACC 3 6 00 

CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 36 60 

TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 3 72 0 

to 

GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 378 0 

GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAATCAT 3 84 0 

GAGACTTCAA AAAGCACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3 900 

15 

TAATCTATTA AAAGACTTAT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3 96 0 

GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 4 02 0 

. AAACAGTCCT CTAGTTTCAT CAGTGTTCAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 4 080 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTTAA TAACGCCTAT 414 0 

TTGGATTTTA TGCCTTGTGG GAATTATGAC G CAT AT G ATT ATTTTTTCAA TAAAATATTT 4 200 

25 AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 426 0 

TATCGCAGGA TTGACGGCAC CCGTTAGCGG ATATTTTTTC ATAGGTCAAT TAACAGTAAT 4 3 20 

ATATGGCTTT GTAGCTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 4 3 80 

30 TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 4 44 0 

CGCTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTA TATTTTT 4 5 00 

ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4 56 0 

35 ACCTTTTTCG CCCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTCAG CAACTGCTTT 4 62 0 

AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTtGG AAAGGkCTTT TGTTTATCGA 4 6 80 

AGTGTTATTA GCCACTGTAA TAGTACTTAG AGT CTTTATA GGATATCTTC ACTTCTTTTT 4 74 0 

40 

AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTAC CAAG A 4 300 

ATTAACACGT ATATTTAATA 4 82 0 
(2) INFORMATION FOR SEQ ID NO: 454: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTK: 4353 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
, n (D) TOPOLOGY: linear 
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ATTTGGATCT TTAATATCAC CAATATTTTT AATATCTTCC GGATTCAATC CATATACTTG 60 

TACTGTATCT GAGTATTTAA TTGTGAAATA ATCACCTGAT TTAACTTTGT CATCAACTGT 120 

AATTTGTGAT TTTAATGATA AATAATCTTG GGCTGGTACG ATTTTATTGT TTTTATCTGC 190 

ATCAACGACA GTtAATGTTG TATTTGATGT GATTAAATCA TTAACATTTT TAGCCTCTGT 240 

TGATGATGGC TGTACTGCTG CTATACGCAT TCTTGTATTC AAACGTTTAG GTGCTGTACT 3 00 

TTTTGGCAAA ATGATATCTG CATTATTTTC ATTATTTGAA TTACTATTGT TATCAACAAG 3 60 

AGTTTCATCA TTACTCTTGA TAGCATCACT TTTAACATTT AATGTAGTTG ATTCAGTTTT 420 

GGCATCTACC TTTTTGTTTT CCTCATTAGT TGGTTGAACA TTTACCACTG ATTTATTCTC 4 80 

TTGCAAATCA GGTTGTAACG CTTCTTGATT ACTTATAGTT TGTTTAGTGT TTAAATCTTC 54 0 

ATTCGTAGAT TTTGGTGAAG CTTGCTCATC TGATTTGGCA GTTGAAACTT CAACTTTATT 6 00 

TCCAGTGGTA GATTGTACAC TTTCTTTTTC TATTAATTTA TTCCCATTTG AAGTCGTTTC 660 

ATTACCTTGa GATGATACCA TTTCTTTTTG ATTATCATTT TTAGTATTGT CTTCTTGATT 720 

TAGTTGCTGC ATATCAACTT TATCACTCGA TTGATTATCA CTTGCTGAAG TTGTCGCTcG 7 BO 

TTCAATTCTT TATTAGTACT TTCTGCAGCC TTTGCTTCTT GGTTCCCCAG ACCAAAAATT 840 

AATGTTGTAC CTACTAAAAT TGATGCTGTT CCCACTGTGT ACTTTCTAAT CGAAAATTTA 900 

TTTAATCGAT TGGATACCAT GCCTTTCCTT GTTATTGCCG TTTTATTTTC TCTGTTTAGC 96 0 

ATT AG ATT AC TCCTAATTCA TCAAATTTTT AAATAATACA ATTGTTTTAA ATACAAAAAT 1020 

GTATATCAAT AT AG T ATTAC ATTTTTAGAT AAAG CACAAT ACTTTAATTA TTTTT CTTT A 10 BO 

TCGTAAAACG TTATTTAACA TTTGTGTTTA AATAAAAGTT TTTATGAGTt TTGTAATCTT 114 0 

TATTTAATCA TCATAAAAAA TAGTATTATT TGCCCTTGAA ATTAATATCT TAGCTTTTCT 12 00 

AATTCATAGA CAATTACATT TCTGTAACAA ATTAAATTGT ATCTATTCCT TAAAGATTTT 1260 

TTGTTTTATA TCTGGGAATT TCTAAACAGA AAAAACCAGG CCACATGGAC CTGGTTAAGT 1320 

TAATCATATT ATTTATTTTG TTTTTTACGA CGACCGAATA ACAATAATGA TCCTAATGCC 1380 

GCGAATAATC CACCGAATAA TGTGCCATTA TTTGAATTAT TATTTTCACT ACCTGTTTCT 144 0 

45 GGTAATGCTT TAGCTGTTTT ATGCTGATCT TTAACCGTAC TCATTGGTTT AGCCGGAGTA 1500 

TGTTTACCTG CATCTGAATC TGAATCGCTA TCTGAATCTG AGTCGTTGTC TGAGTCCGAA 156 0 

TCGCTATCTG AATCTGAGTC GCTGTCTGAA TCTGAATCGC TATCCGAGTC TGAGTCGCTA 1620 

TCTGAGTCTG AGTCGCTATC TGAATCTGAA TCGCTGTCTG AGTCTGAATC GCTATCTGAG 16 80 

TCTGAATCGC TGTCCGAATC TGAGTCGCTA TCTGAATCTG AATCGCTATC TGAATCTGAG 174 0 
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TCTGAATCTG AGTCGCTGTC TGAATCTGAA TCACTGTCTG AGTCTGAGTC GCTGTCTGAG 186 0 

TCTGAATCGC TGTCAGAATC TGAGTCGCTA TCTGAGTCTG AATCTGAATC ACTGTCTGAG 1920 

5 

TCCGAATCGC TATCTGAATC TGAATCGCTA TCTGAGTCTG AGTCGCTATC CGAATCTGAG 1980 

TCGCTATCTG AGTCTGAGTC GCTATCCGAG TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 204 0 

TCTGAATCTG AATCGCTATC TGAGTCTGAG TCGCTGTCTG AATCGCTGTC TGAATCTGAG 2100 

w 

TCGCTATCTG AATCTGAGTC GCTATCTGAG TCTGAATCGC TGTCAGAATC TGAGTCGCTA 216 0 

TCTGATGTTT CTTCTTCGTA GTAGCCATTA TCAAGTGTGA AATCATCATG ATCCGTAATT 2220 

J5 GTTACATCAA CTTCGCCACC ATCGGCATCT TTATCATCTT CAGTTGTATT TGTACCTGTT 22 80 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA ACTTTGTATT TACCACTATC TAAATTATCA 2 34 0 

AAGCGGTATT TACCATTTTC ATCTGTyTCA GTTGTACCAA TTACTTCGCC TTTTTCGTTT 24 00 

20 TGCAAAGTAA CTTTAACACC TTTAATTCCT TTTTCAGTCG AATCTTGTTT ACCATCTTTA 24 6 0 

TTACTGTCGT ACCAAACATA ATCACCTAAA CTATATTTTG GTGTTTTGTA GAATCCACTA 2 52 0 

TCTAATGTCA TGTTGTCAGC GTCTTTAATG ACACCTGTTG TAGTTAGTCC ATCAGAATCT 2 580 

25 ACAGCATCAT CTGTACCTAC ATTTGCAGTT GTCGGTGTAT AACCGGCTGG TGTTGAAAAC 2 64 0 

TCTACACTAT AAGTTCCATT G CTT AAACCA GTGAACTGAT ATTTACCATT TTCATCTGTT 2700 

GTCGTACGAT CTAATTCTTT ACCGTTACTA TCTTTAAGAA TGACATAAAC ACCTTTAATC 2760 

30 

CCTTTTTCAT TGGCATCTTG TTTACCATCT TTATTTGTAT CTTCCCATAC ATAGTCACCT 2820 

AGATTATATT TCTTTTGGTC GCCATTAGCA GTTGATGAGC CATTCACATT TGAATAACTA 2880 

TTTGACCAAC TATATTTAGT TTTGTCAGTG TCTAAAGTAT AATCAATTTT TCCATTATCT 2 94 0 

35 

GTTGAACTAT TATCTGGATA AGCAACTTGT TGAATGATGT ATTGTTTATT GCTGCTTGTT 3000 

TGGCCTTTCA TTAAATCGAC TGTAGCTGTT TT ATT AT CAT TACTATAAAT AACATCGAAT 306 0 

40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA TCAGGGGTGA AACTATCCAC AAATTGATTT 312 0 

TGATCTGtCA CTt CGTAAAT TTTGAAGTTT TTTGCATTTG GATTAAATTT ATATCCAGTT 3180 

AAATTAGTAA CAAACGTTTG TTTAGTATAT GTATTTTTAG GTTGATTTAC ATATGCAGTC 3 24 0 

45 ATATTACGCG ATAAATCTTC ATTGTTAATA TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 3 3 00 

TTTTTATTAC CATAATCGAC AATGATTTCT TCGCTATATG TATCATTACC TAAAGTTACT 3 360 

TCCATTTTAT AAGCTGTTTT ATCAGTTGTT GCATTTTTAC GTTTCGCAAA TGCAACTTGT 342 0 

50 

TCAAAGCTAC CTCTAACATT TGTATATTGA TCTACATAGT TCGTAAAAGT ATATGTTGTT 34 80 

GTGTTTGTTG TACTATCATA AAT AC CTTTT GCAATAATAT TACCTTGGGC ATTATATAAA 3 54 0 
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w 



15 



GTAAATGTAT CGCCCTCTTT AACAGAATCA TCGATTGTGT AATTTGCTTT TAATTTTAAA 3 660 

ACATCACTTG AAG7TGCCCA AAATTCAGTT TTACCAGTAG TCTGATTAAC ATGTCCTTTA 372 0 

TCAATCGCAA TGTCAATATT TGAAAAATGT ACTTTATCAT TAACATTTGT TCCTTGTTGT 3 7 BO 

GGAGCTGCAA CAGTATTCAC TGCCATGCGA TTTAAAGTTC TTGGTTTAAT AGTCGTTGTT 3 34 0 

TTAGGTGTAG TTGAAACATC TTTTGCTTGT GTTAAATTAC TTTTATCAGT TTCATTACTA 3 900 

TATGTAGTTG ATGATTTATC ATTTGTTGTT ACATTGCTAG TTTTTGTAGT AGATTGATTA 3 960 

GCTGTAGCGT TTTGTGGTGA TTGCATGTTA CTACTAGTTT CTTTAACTGT TGCACTATCA 4 020 

CTCATTGTCA CTTTAGGCTG ATCTGCAGTT GCAGTTTGCG TATTGTCTTT TAGTTGACGA 4 0 80 

CTATCAACTT TTTTAGTTGT TTTATTCTCA CTTGGGGCTG TCGTTTCATT TTTTGATTGA 414 0 

TTTAATTCTC CATTCGTATG TTCTGCCGCT TTAGCTTCAT GACCACTTAA CCCAAAAATC 4 2 00 

AATGTTGTCC CTACTAAAAT TGAAGCAGTA CCTACAGAAT ACTTTCTTAT CGAAAATTTG 426 0 

TTTAATCGAT TTGGTATCAT GCCTTTnCTA TTTGTnGCTG TCTTTTTATA ATTCATTTAA 4 3 20 

TAATACTCCT TTAAAATATC AAAATTTGAT AAATATAA 4 3 58 
(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

35 

TTGACTTCTT AATTCAGCAT TTTCTGCACT TAATGCTTTG TTCTTTTTAA TAAGTTGCTT 60 

TCTTGCATAA ACTTCGGTAT CTATTTTACT ATTACTATAC CTTTGATTTA AAACTAATAT 120 

40 ACCAATTAAT GCTACAATGA TAATGATAAG TACAACATAA AAAGACATTT TTTCACCAAT 180 

CCTTTTTGAC TTCTTTAACT TTGTATACAA TAATAATTAA TAAAGATTAA TTGTTATTCA 24 0 

ATTTCCCACA TTTTTATTAG TTGATTTTAG TTCATCATTG TTATAATCAA ATTATAAACT 3 00 

45 GACAGATATT GATGTTCAAT GAATATGACG TGAAAGATTC GTGAATTCAA GTTTATGTCG 360 

AATTTATGTT ATAACGGTCA TTTAAATGAC AGAATTAGGT CACTCATAGT ATTTTGAAGA 42 0 

TTGAATTCAT TAATTTTAAA ATGTATAATG ATATTTGTGA AAGCGCTTGC TTAGGAGGTG 4 60 

50 

TATTTGAGAG TGAATGAAAT GAATGCTAAA GAACAATTAG TGGACAATTT AATGAAAACA 54 0 

T CAT CG C AAT TATTTAAATT TCACGGTGAA GTTGCCATGC AG CTTTT CTT AAATGATGAA 60 0 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 720 

AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 780 

5 ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 84 0 

AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATT GGG A TGTGGATATA 9 00 

TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 960 

10 TAAcGrGCAT AGTGCATAAn yACTwCAt TT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 1020 

TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 1060 



(2) INFORMATION FOR SEQ ID NO: 456: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

25 ATAATnACTA AATACnAAAG TTTAACTGTC TTACTAATAA TGACTATGTT ATAATTTTAA 60 

AAGTGATATT TTGGG T AAT C GCTATATTAT ATAGAGGAAA GTCCATGCTC ACACAGTCTG 12 0 

AGATGATTGT AGTGTTCGTG CTTGATGAAA CAATAAATCA AGGCATTAAT TTGACGGCAA 180 

30 TGAAATATCC TAAGTCTTTC GATATGGATA GAGTAATTTG AAAGTGCCAC AGTGACGTAG 240 

CTTTTATAGA AATATAAAAG GTGGAACGCG GTAAACCCCT CGAGTGAGCA ATCCAAATTT 300 

GGTAGGAGCA CTTGTTTAAC GGAATTCAAC GTATAAACGA GACACACTTC GCGAAATGAA 36 0 

35 GTGGTGTAGA CAGATGGTTA TCACCTGAGT ACCAGTGTGA CTAGTGCACG TGATGAGTAC 4 20 

GATGGAACAG AACATGGCTT ATAGAAATAT CACTACTAGT TTAGCTCTCC TAGATGATGG 4 80 

AGAGCTTTTT TCATGAAAAG AACACTTAAA ATTAACG CCy TGTCTTGaTA tAATGACaCT 54 0 

40 

GCcTTGTTTT AAAATAGTAA GCGGATGCgT TAATGTATCA GCGATTAAAT TTGTTGGAAA 60 0 

TGTATAAAAA ACACAAGCTA AGAATAAAAT ACCTGTATAA AAGGAGAATC ATATATGTTT 66 0 

CAATTACTTG CAGTTTGTCC GATGGGATTA GAAGCTGTTG TTGCTAGGGA AATTCAAGAA 72 0 

45 

TTAGGCTATG AAACAAATGT TGAAAATGGT CGTATATTTT TTGAaGGAGA CGCAAGTGCA 78 0 

ATTGTAAAGG CAAATTTATG GTTGCGCACA GCAGACCGAA TCAAAaTTGT TGTTGGACGT 84 0 

50 TTTAACGCAA CAACGTTTGA CGAATTATTC GAACAAACCA AAGCGCTCCC TTGGGAATCT 900 

ATAATTGATA AAGAGGGTAA CTTCCCAGTT CAAGGTAGAA GCGTTAAATC AACACTACAT 96 0 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 108 0 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 114 0 

5 TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 12 6 0 

CA 1262 

w 

(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

CCTGGCTGCT TTATCAG CAT CTACTACTTT AAAACCGAAT ACGGATAAGA GTTCTGATAC 6 0 

TGTTGATTTT CCTGAGGCGA TTCCACCTGT TAGACCAATA ACTTTCGGCA TAATTTCACT 12 0 

25 CTTTCTTTAT TTTTGACATA CTGGACAATA ATGACTATTT CTTGTCGCGA TGATTTTTGT 180 

TTCAATTTGA CTTCCACACA CTTTGCATAC CGGCTGCTTA TATACATTAA GATGCAATTG 24 0 

CATCTCACCA GTTTTTCCAT CAGCATGACG ATAATCTGAA ATACTTGTAC CGC CAT ATTT 3 00 

30 AATACCTTCT TCTAGTACTT CTCTAACATA ATAAAAAACC ATTTCTTGTT GTTGGTGTGT 360 

TAAGTCTTTT ACTTTTTTAT CTGGTAAAAC ACCTGCACGA AACAACGCTT CACATGCGTA 420 

AATATTTCCA CAACCTGCGA TTACTTTATG ATCCAAAATC ACTTGTTTGA TTGGTTTATT 480 

35 

CTTATTAGAC TGTTGATGAA TTCGATTTAA ATAATACGTC AATGCTTCAT TTGAAAAAGG 54 0 

TTCAGGCGCT ATTTCTAAAA ATGAAGGATA AGATGCTACA GACGCAACAT TTCTAATTTC 600 

TCCAAAACGA CGTATATCTG AATAAATTAA CTTTTTGTCA TTTGACAACT CAAAAATAAC 66 0 

40 

ATGCCAATGC TTACGATAAT TAGGTATCAT AATATCTTCA AGTTCATCTA CAATGAAAAA 720 

ACCGCCCGCC ATACCTAAAT GACTAATTAA TGTACGTTGT TCTCGTTTAT TATCTAGCTG 78 0 

4S AAAAACGATA TATTTACTTC TTCGTTCTAC ATTTGTAATG GTATAGCCTT CCGATAAAGT 84 0 

TTTAAAAGTA TCTAATTCAA TTCCTTTTAT AATTGTTTCC TTGCCTTGAG CTTTACCTTC 90 0 

GATTACTTTA TCCGAAAATA TAACGTGTTC AATTTTTTGA TTTATAACGT AGGGTTCAAT 96 0 

so TCCTCTTTTT ACATGTTCTA CTTCTGGTAA TTCGGGCATA CCATTAACCT CACTTTATTT 102 0 

TGCATCATAC CAGGTTGCAC CATAACTTGA GTCTACTTTT AATGGAACAT CTArTTGCAA 108 0 
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TT 1142 
(2) INFORMATION FOR SEQ ID NO: 458: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 
CCTTTAGTAA ACAATCCTTC TTTAGTTTTA GTACGTTGTT CCAATCCAAA TAATTTGTAT 60 

15 

TTCATTGCCT CGCCCGATTG AGTGCCGCTA AAGTTATCAT CTTTCATGTT AGGCGTGTTG 12 0 

GTAAACATGT GTATATCACT GTTTAAACGG TCTTTATAAG CTTCGGTACC TTGTACATCG 18 0 

TATTGCTTAT AAATATAACC ACCATCAACA GAGCCTTCTG TTTCTCTACC TTCGCTATCA 24 0 

20 

GCATAAACAG TCGGTTCTAA AAACAACACG TTAGCTTCCT TTTGTTTTCT AACTTCTACA 30 0 

GGATCTAAAT TTAAATT AC C TTTAATAAGT AACATAGCGT CATTTAAATC ACTCATATAG 3 60 

25 TTAGCAsymy CTGATTCAGC ATTATCATAC AAATCAATTA AAGTGATTAC TTTCTCATAA 4 20 

TCCCCTTTTC TTCTTTCGTT GTTGCTAAAT TCTGTAATAG GCATACGTTC GAAAGAGTGT 480 
GATTCAAAAC CGTTTTCACG TGGTGTGAGC TTCAATCCAT TTGTTCTACT GGTAAGATAT 54 0 

30 CTATAAACAC CGTGTGAAGT GAATAAATCA ACTGTAAACA CTTCATCTTC GTCAGTCTTG 6 00 

TCTATTGGTT TAGTTCTTAA ATATCTAACG CCTGCGATAC TATTACGTTC AATTGTATTG 66 0 

TCGTATATGA CAAAAGTACT CATTGCATCA CTCTTGTATA AACGCGTTTC ATCATCTTGG 720 

35 

TTTCTAATCA TTAACTCATA AGCTTTGCCA TAAATTGACA AATCTAATCC TAAAGATCTA 780 
TTGTGTGACT CAACATCATT TAAATCATTG AACGCCTCAA TAACTTCTAA TACATCTTTG 84 0 

TCATCATCTT GATATTGAAT TGGATTACCC AAGAAATAGC C GTTG AT AAA ATCGCTAATA 9 00 

40 

TAAGATGCGT AATCATGCGC TACACGGTTA TCTGCCATGT ACTCTTCTTT GCGTCGTGTT 960 
AACTCAACTA AGTTCTTAGT TTTACCTTCG TAATAATCAC TTAACACTTT CAATCTAGGT 102 0 

45 CGTTGGTAAT CCATGTGATG TTCAATGTAT TTACTTACTT CATTAACGTT TTGTAATAAA 108 0 

TCGGATTCCG TCCCGTCATA TGTGTAAACA ACATTGGCTT CATCATTAAA TAAGTAATTT 114 0 

ATGTTTCCCC GTAGATCTGT ATCTGTTTCA AATTCGTTTA CTTTTAACAT TTGTTCCCTC 120 0 

50 CTATAATCCT AGAGATTTTA TTGTGTCAAC TTTCGAACTG AGATTTGTGC GTTTt CTAAC 126 0 

CGGTCTGTAG AATCGTTCCA CTGAATAACG CAACGAATCG ATACAATGAT TGTATGTATC 1320 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 14 4 0 

TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 156 0 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1680 

10 AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 174 0 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 180 0 

ACCTAACGCA TAAA 1814 



15 

(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

2S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 6 0 

CTGGTTTCAT GTCTTATGGG GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 120 

30 TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATGCATTCAA 180 

TTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 24 0 

TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 3 00 

35 AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 360 

ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 420 

ATTAGCGAAA ATGAATTTGA AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 480 

40 

TGGCCGGTTG CAGGAAGTTC AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 54 0 

TTTAATGTTC AGTTTGAAAA ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 600 

ATGTTAATTT TTTATTTTTA TCTAAATAAA AAATTAACAT TAAAAATTTT TAATACCAAC 66 0 

45 

GTGGGTAATA AGAATAGGAG TTGTAT 68 6 

(2) INFORMATION FOR SEQ ID NO: 460: 

so (i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

5 ATCTGCAATT ATGGGCACAC CCAAGCTnAT GCAAGTAACT AAAGGAGAAG TACTTTTAGA 6 0 

CGGTGTAAAT ATTTTAGAaT TAGAAGTTGA TGAAAGAGCA AAAG CAGG AT TATTCTTGGC 120 

AATGCAATAT CCATCAGAAA TTACAGGTGT TACAAATGCT GATTTCATGC GTTCAGCAAT 180 

10 

CAATGCGAAA CGTGAAGAAG GACAAGAAAT CAACTTAATG CAATTTATTA AGAAATTAGA 24 0 

TAAAAACATG GATTTTCTAG A CATAG AT AA AGACATGGCA CAACGTTATT TAAATGAAGG 3 00 

TTTCTCAGGT GGAGAGAAGA AACGTAACGA AATCTTACAA TTAATGATGT TAGAAC CTAA 36 0 

15 

GTTTGCaATC TTAGATGAAA T CG ATT CAGG GTTAGACATC GATGCATTAA AAGTTGTATC 4 20 

TAAAGGTATT AACCAAATGC GTGGGGAAAA CTTTGGTGCA TT AA TG ATT A CACACTATCA 4 80 

20 ACGATTATTA AATTACATTA CTCCTGATAA AGTACATGTA ATGTATGCTG GTAAAGTCGT 54 0 

TAAATCTGGT GGTCCAGAAT TAGCAAAACG TCTTGAAGAA GAAGGATATG AATGGGTTAA 600 

AGAAGAGTTC GGTTCAGCTG AATAATCTTA TTAATACAGT ATCCATGAGA TGTTCATCTA 660 

25 TATATGATGA AAATGAACAT TTATACGAAA TAGTAAATTT CATCAAGTAG GAGGAAAAAG 72 0 

TTATGACAAC TGATATTTTG rACaTTyCTG AAGAACAACT TG T TG ATT AT TCTAAAGCCC 78 0 

ACAATGAACC TTCTTGGATG ACAGAATTAC GTAAAAAAGC TTTGAAATTA ACAGAAACTT 84 0 

10 

TAGAAATGCC AAAACCTGAT AAAACAAAAT TAAGAAAATG GGATTTTGAT TCTTTTAAAC 90 0 

AACACGATGT AAAAGGTGAT GTTTATCAAT CTTTATCACA ATTACCTGAG TCAGTAAGAG 96 0 

AAATTATTGA CGTAGATCAT TCTAAAAACT TAGTAATTCA ACATAATAAT ACGATTGCGT 102 0 

35 

ACACACAAGT TGATGATAAT GCATCGAAAG ATGGCGTTAT CGTTGAAGGT TTAGCAGACG 10 80 

CTCTTATGAA CCATAGTGAT TTAGTACAAA AGTACTTTAT GAAAGATGCA GTAACAGTAG 114 0 

ATGAACATCG TATCACAGCG CTACACACGG CATTAGTTAA TGGTGGCGTA TTTGTTTATG 120 0 

40 

TTCCTAAAAA TGTAGTTGTA GAACATCCAG TACAATACGT TGTGTTGCAC GACGACGAAA 12 60 

ATGCAAGCTT TT AT AAC CAT GTTATCATCG TTACTGAAGA 13 00 
45 (2) INFORMATION FOR SEQ ID NO: 461: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH . 313 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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GACAG CAAAT TCAAGATACA TTAAATAAAG ATATTGTCAT AAAGCATATT CTTGTTCGAG 60 

ATAAATCTAA AAAGAGACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 120 

5 AAATTTTAAA TGATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC G CAAAT AAA G 24 0 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 36 0 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATTTTATTT 420 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 4 80 

15 

TTGGTTTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 54 0 

TTGTCATTAC ATCATATTTA TCATTTAACC AAGTCATTAA ATTAAACGAC GTTAAACGAA 6 00 

GAGGAATTAG TGGCGTAACT TTAACTGATA TTAATGTAGC CGATCAACTG GGGTATAAAA 6 60 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

CGTTAATTGA TAAAAAGCAT CAATTAGCAG CTGTAGAGGA TGAATATAAC GCGATTTATG 78 0 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

CAGGTAGTGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC AC CAC ATTTT GAATTAAAGA CAGATAAAAC ACGGGAAATG ATGGATTCAG 96 0 

30 ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTCAC CGATCATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG GAATCATCAC 114 0 

35 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAGAAGAGT TTAAAGCACA TTTACCAGTA 1260 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

40 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC CAATCCGACA 13 80 

GTTCATTTAA AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 1440 

45 AAATTGTAAT ATGCGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 15 00 

GAGCAGGTTT AAAAGCTATC GTCGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 156 0 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGCTT 1620 

50 TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 16 80 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 174 0 
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AAGGCTTTAA AGAATATCAT GAAGCTAAAG GATCACAATT GCCGAAAATG TTTGGCTTCC I8 60 

AAGCTGAAGG CGCATCACCA ATTGTTCAAA ATAAAGTCAT TAAAAATCCT GAAACGATTG 1920 

5 CAACTGCTAT TCGAATTGGT AATCCTGCTA GTTGGGATAA GGCGACTAAT GCTCTTAAAG 1980 

AATCAAATGG ATTAATAGAT AGTGTTACTG ATGATGAAAT TCTAGAAGCA TATCAGTTAA 2 040 

TGACAACTAA AGAAGGTGTC TTTAGTGAAC CAGCGAGTAA TGCTTCTATT G CAGGTTT AA 2100 

w 

TTAAATTGCA TAGACAAGGT AAATTACCTC AAGGTAAAAA AGTAGTTGCT ATTTTAACTG 2160 

GTAATGGATT AAAAGATCCT GATACTGCTA TTTCACTACT AGATAATCCG ATAAAGCCAT 2220 

TGCCAAATGA TAAAGATAGC ATTATCGATT ATATTAAAGG AG CTTT ATAA CATGTCGAAT 22 8 0 

15 

GTTTTGGAGT TAACAATTCC TGCATCAACA GCCAACCTTG GAGTTGGCTT TGATTCTATA 234 0 

GGTATGGCTT TAGATAAATT TTTGCATCTG TCTGTAAAGG AAACATCAGG GACAAAATGG 24 00 

20 GAATATATTT TCCATGATGA TGCATCTAAG CAATTGCCTA CTGACGAAAC AAA CTTT ATT 24 6 0 

TATCATGTAG CACAACAAGT TGCTTCTAAA TATAGTGTTG ACTTGCCTAA TTTATGTATC 2520 

GAAATGAGAA GTGATATTCC ATTGGCAAGA GGGTTAGGTT CGTCAGCTTC TGCTTTAGTA 2 5 80 

2S GGAGCTATAT ATATCGCAAA TTATTTTGGT GATATCCAAC TGTCTAAACA TGAGGTATTA 264 0 

CAATTAGCGA CTGAAATCGA AGGACATCCT GATAATGTTG CGCCGACCAT TTATGGTGGT 27 00 

TTAATCGCTG GATATTATAA TGATGTCTCG AAAGAAACGT CaGTtGCACA TATCGACATA 276 0 

30 

CCAGACGTGG ATGTGATTGT AACGATACCA ACTTATGAAC TAAAAACAGA AGCATCAAGA 2320 

CGTGCTTTAC CACAAAAATT AACACATAGT GAAGCGGTTA AAAGTAGTGC AATTAGTAAT 2 8 80 

ACAATGATTT tGgCATTAGC ACAGCACAAT TATGAATTAG CAGGTAAACT CATGCAACAA 294 0 

35 

GATGGCTTTC ATGAACCGTA TCGTCAGCAT TTAATTGCTG AATTTGATGA AGTGAAAACA 3000 

ATTGCTAGTC AACATAATGC CTATGCAACT GTAATTAGTG GTGCTGGACC AACTATTTTA 3 06 0 

ATATTTAGTC GTAAAGAAAA TAGTGGGGAA TTGGTTCGCT CTTTAAATAG TCAGGTAGTA 312 0 

40 

TCATGCCATT CTGAA 313 5 
(2) INFORMATION FOR SEQ ID NO: 462: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 
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AGnTCAATAT TTAGATCAAG CCGTTTTAAG TAATTACGAA CAAGTTTATA TCATTCATGG 120 

TAAAGGTACA GGTGCACTTC AAAAAGGTGT ACAACAACAT TTGAAAAAGC ATAAAAGTGT 180 

5 TAGTGACTTT AGAGGTGGTA TGCCAAGCGA AGGTGGATTT GGCGTTACCG TTGCAACACT 24 0 

AAAATAAATT ATAATTTGAT AAATTAAATA GCTGCAGTTA AAATAATGTA AAG CAACAAG 3 00 

AATACATTTC AAACATGTTA TTTGAAATAA GCATAAAAAT TGAGCAAATA GAAATACATG 3 60 

10 

AAGCATGTTA TCTGATATAA TTTGAACATC ATAATAATAA TTAAGGAGGA TTGGCATTTA 4 20 

TGGCAATCGT AAAAGTAACA GATGCAGATT TTGATTCAAA AGTAGAATCT GGTGTACAAC 4 80 

TAGTAGATTT TTGGGCAACA TGGTGTGGTC CATGTAAAAT GATCGCTCCG GT ATT AG AAG 54 0 

15 

AATTAGCAGC TGACTATGAA GGTAAAGCTG ACATTTTAAA ATTAGATGTT GATGAAAATC 600 

CATCAACTGC AGCTAAATAT GAAGTGATGA GTATTCCAAC ATTAATCGTC TTTAAAGACG 6 60 

GTCAACCAGT TGATAAAGTT GTTGGTTTCC AACCAAAAGA AAACTTAGCT GAAGTTTTAG 72 0 

20 

ATAAACATTT ATAAGTTACA ACCAATGACG ACTGGGGCAT TTCTTTAATG AATTGCTCCA 780 

GTTTTTGTTT GTGTTTTTAA TATAAAAAGT TGAATGATAA GTCATCATAT TGTTTACGAC 84 0 

25 TTGAGAATGG TGGGATTAAT AAATCTATGA ACGTTAAATG ATAATCTAGC ATGCTGATAG 90 0 

ATTTGTAGCA GTTGGTTTGA TAAAACCATG TTCAATATTA CATGATGTGC ATGAAAAGTC 960 

ATACT CGAAG ATGTTGATTA TTAACTAGAA TT AG TGGTG A TAAATTTGAA GCACTTTTGT 1020 

30 AGCATCATTC ATTTTAAAAT TAGAAGGGGG GATATTTTTG GAAGACTATA AGCAACGAAT 108 0 

TAAAAATAAA TTAAATGTCG TACCTATGGA ACCAGGATGC TATTTAATGA AAGATCGTAA 114 0 

TGATCAAGTG ATATATGTTG GCAAAGCTAA AAAGCTAAGA AATCGATTGC GATCATATTT 1200 

35 

TCACGGGTG 120 9 

(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 
4 ° (A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 
AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 6 0 

50 TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 12 0 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 180 

55 
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GCATCTTCAA CTTCAAAAGG CCCAATCGTA 
CGTCCTTCAA ACCAGAAATA ACCATCATCA 
5 TATTtlACCTG TTTGCGCTTT CGCCGTACGT 

GCTGGCAAAT CAAGTGGTAC TGCAATATTC 
TCATCATCTA CTACAGTGAC CGAACTACCC 

10 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC 
CTTACAGTTA AATTAAAGTA TTTCTTGAAT 
CCTGCAGAAA CGGCACTATG TAAATGCGTT 

15 

GCCATCATAC GATATTCTGT CGGTGTACAA 
AATTCTAAGT ATGTTTCAGG ACTGAACCTT 

20 CCTAATACAG ATAAGAAAGG ACTCCATACC 

CAAACTAAGT CATCTTCATT aATACATaAC 
AATCCCCaTC CATGACAATG TGTAACGGCT 

25 GACAGAATCG C CAT AT CATC ACGCGTCGTA 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA 
AACGCATCAT ATTCTTTAAT TTTTTCAAAT 

10 

GCATTAATTT CACCATGTGT GATACGGTAT 
CATGGAATGA TTGCAACACC TAATTTTAAA 
GATCGTGGCA TCATAATGAG TACTTTATCG 

35 

TTACCTACTT TATTAGACTG TTCAATGAGC 
TCAGTATTAT GATATAAAAT TGCCTTTTTA 
GAAATAATGT TATATTTTTC AGGCGCGAAT 

40 

TCATACATCC ACTTTTCCTG TGATGAACAT 
TACGCCTATC TTTACAGAAT TTTCAATTAA 

45 ACACTTTGAC ATGACGTAGC ATTCAAGGAC 

CT AG AC CAT A TCAGCTAATT CAATACGAAT 
TACATCTGCA CCTTTCGAAG TGCCACGATG 

SO TTGCCAATGA TAAAATGCTT GCCTATTTTC 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC 

55 



TAGCCTGAAC TAATAATAAT GTCATCTCGA 3 00 

TCTACATGAG CTAAGTCACC AGTGATGTAG 360 

tCTGGCTCTT t AT AAT AC CC TTTGAAAAGT 42 0 

CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 4 80 

GGAATGCCTT TACCCATTGA TCCAATCCTC 54 0 

AAGGTACTTT CAGTCTGGCC ATATCCATCT 6 00 

TGTTCAACTA CTTCTCGATT TAGTGGCTCA 660 

AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

CATAAAACAT TAATTTGATA TTTTTGAAGC 780 

CCATTAAATA CAAAAGCAGT TGCACCTGAA 84 0 

CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAATGTTTTG GTGCCATTTG TaAATGTGcA 96 0 

TTAGGATTGC CAGTTGTACC AGATGTATAT 102 0 

TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 10 BO 

TCTTTTTGAC CGGCAATAAC AAATTTAGTT 114 0 

TCAACTGTGA ATGGCTCTAG TGCAATAACT 1200 

TGTAAATCTT TAGTTCTTAG CATTTCAGAA 126 0 

G CAGCAAT AT ATAATTCATA CGTCGCAATA 1320 

CCTTTAGATA AACCGTGCGA TGCTAAAACA 1380 

TGTTGGTAAG TGACTGATAT ATCTTCGCCT 144 0 

TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

AGAGCTGACT TTTGCATAAC TAACTTCCTT 156 0 

TGTAATTTTA TAAATGAATT ATATACATCA 1620 

ATAGGGTTAA ATACCAAAGT CCTCGACACT 1680 

TTTCAAATGA TTGAGGGTTG ATATCTCGGG 174 0 

ATTGTATGAT AATTCACGAT TAATTATTTT 1800 

CTTGTGTGTA TGCTTGTACT CAGCTGAATT 1860 

CCACAGCGTA ATAATGATAT AGTGTCTACC 192 0 

TTTAAATCCA TCAACGTTTT TTAAATGCTT 199 0 
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ATTTAACACA CATAATGAAT CATTTGATAA AT CATTTATC GCTTCTAGCA CATCGTAATA 2100 

TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT 216 0 

TCTATATTCA TCATAAATTT T CAT AAT AAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 222 0 

ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 2280 

TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GT ATT AAT AT TTCATGCAAA TGGGGGACAG 2 34 0 

GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 24 00 

AATAGCCGAC 2410 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

2 5 TTTATTAATT GTAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 60 

ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 12 0 

ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 180 

30 

CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 24 0 

TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 300 

AC TACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 360 

35 

TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT TACCATTAAT 420 

ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 480 

TCATTAATCA AG TT ATT ATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 54 0 

40 

AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 590 
(2) INFORMATION FOR SEQ ID NO: 465: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCGATTT G An T CTTTT A AAATTATGTT TAAAAACATG TCTAATGATT CCGCATCATT 12 0 

TTTTGCACCA TCAATAAGCG TTTCAGcAAA CCCCTTAATT GAAGTAATAG GTGTTTTTAA 180 

5 TTCATGTGAA ACATTTGCTA CAAATTCACG TCTTAGATTT TCAAGTTGTT TCAGATTTGT 24 0 

TATATCATGC ATCACaACTA AAATCCCtTG CAAACTTTTT TGAGACCTAG TTAAAATCGG 3 00 

AACGCATGAA ATATCAAAGT ACTTGGCATG GACTTGGTTT ATTGCAACTT CCAATTGTTC 3 60 

w 

ATAAATAGGT TTTTCAACTT TAAAACTTTC TAAAATTAAT TGCTCAATTT CAGTATTAAC 4 20 

ATAGCCGTGA TAGCCTACTT GTTCAATATT ATGCGAGATG TTGAACTGTT CATAATACGC 4 80 

TTTATTTGCA ACAACGATTT TTCCATTTCG ATCTATCATT AAAATAGCAC TTGGAATATT 54 0 

15 

TTCAATCGTT GTTTTTAAAC GGTTGGATTG AATTTTTTGC TCATTATTAA GCTTTTGAAG 6 00 

GCGTCGTGCT AAATCATTGG TAGACACAAA AAGCGCTTTA GTTTCTACAA CATTACTTTC 6 60 

2o AGGTACACGT ATGTGATAAT AACCATTTGC CAACAATTGT GTTGCATAAG TAACTTCTTG 720 

AATGGGACGG ATTAATGTAC GCTTAAAACT ACGGCTTGCA AAATACAGAC AAATGAGTAC 780 

AACTAAACAT GTCAAAATAA GATATTTCCA CAACGTCCAA TGCATTTCTG TAATAT CGTT 84 0 

25 ATTGTAACCT TTAATCCATA CATGATAACC GTTAACCTTC TTATtAAAAA TAAAAACGTC 90 0 

CCTTT 90 5 
{2} INFORMATION FOR SEQ ID NO: 466: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{DJ TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
TTTGGTTAGC CCATAAAAGA AAAAAACAAG TAGTCATTT T TAAACAACAT ATCAAGTCTA 6 0 

40 

CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG TGCTTGAACA CAAAGATGAA ATAGCAAATT 120 
TTATTTCTTT CGAACCACAA AGTTTTGAAT TTTATTATTT TACAGAATCG GAATTTTCAG 180 
AAGAACAATT AAATGAAGTT TCGCCAATTA GAATTAAATT CAATGTTATA AGACACACAA 24 0 

45 

AAGATTTGaT AAAGCATATG CCGAATATAT TTTTGGcTAG ACTTATTTCA GAAGATAATG 300 
ATAAAAAGAC ATATATGTTC TATAAACGCA AAGTATTAAC CGATAACTTT TTAGATAAAT 36 0 

50 ATATGCAGAA ATTTTCACCG GCAACATACA CAATAATATT TGTAAATGTC TTAATATGGT 4 20 

TATGTATGAT TTTATATTTA AATAATTTTT CGGATGTAAA ATTATTAGAT GTTGG CGGGT 4 80 

55 
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ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 6 00 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 660 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 72 0 

TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 780 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 90 0 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 96 0 

tATATTTAwT GCACyTCmAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 24 0 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 3 00 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 3 60 

kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 4 06 
(2) INFORMATION FOR SEQ ID NO: 46 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
50 AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 6 0 

AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 12 0 



40 



55 
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AGGTACAGCG TTAAAACTAA GAGAAG CCAT CAATTATGAT GAAATGGTTA TTGTAGATAG 24 0 

TATGACGTAG TTCCTAATTA TGCkAAAAGG GATTGATGAA AAACTGAAGG GCTTTTCATC 3 00 

5 AATCCCTTTT ATTTTAGGGG AATTGAATAG ATAGTTTTAA ACTATACGAA TTATTAATAT 360 

TTGAGATTTA ATTGAAATAA GTTTTAAAAA TTG GAG GAGA TAGATTAAGC GAAGTCATTT 420 

AAAGGTGAAG TTAAGTGTAT TCACAAAAAn TAGCCACACT CATATGACAT CGGATGAGTG 4 80 

10 

TGGCTTAAGG ATCTATGGGG GGAGGAAnCC ATAGATGTTT ACTTTGATAG GCCAGATTAA 54 0 

ATATCAAAGT ATGCGATTAT TTATAGCTTG ATGCAAAAGT GGTATGCCTA TTAAAAGTTA 600 

CTGCACATAG CTTTTAATAT TCCGTTCAAA GGAAAGGGGC ATACAATTGA ACAATCTGTA 660 

15 

ATAGTACTTT TAACCAGCTA TGCTAAAAGT CTAGTAGGGA GAACAGTTGT CCAATCACAT 720 

AAGAACCTCT AACTTCGTTA GTACGATTAA GAAAAGCTTT TTAGTTAGTA TGTAATACAA 780 

OQ TTTATTGACG CGCGTGAATC T CTTTT AT AA GAGTGTGTAG GGAATGGCGT TGTATAAATT B40 

GTATTAGAAG AACTTCTAAC GCATCTCTGT GG TTAAAAG A GATGAAGGGA ACGACAGTTT 900 

aATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTCTCGT T CAAAG AG AA GCAGcTGTTC 96 0 

25 GCAGTTTAAT CAAAACCACA TAAAG CTTTT AACTTTACTC TTTGATTTAA AGAGTGATAA 1020 

ATGTTTACAG TTTAATTAAA ACTGCATAAG AACTTCTAGC TTTTCTCTTT CGTTCAAgAG 10 80 

AAGCAGCTGT TCGCAgTTTA ATCAAAACCA CATAAAG CTT TTAACTTTAC TCTTTGATTT 114 0 

30 AAAGAGTGAC AAATGTTTAC AGTTTAATTA AAACTGCATA AGAACTTCTA GCTTTTCTCT 12 00 

TTCGTTCAAA GAGAAGTTCT AATACCACCA TATCGTGCGA TCGGGAACGG TATATATATT 12 60 

AATAGGAGGG TAATATATAT TTAACGCACG ATATGGGACT ATTAGCCTTC GACTTTGTTA 1320 

35 

TGTTGATGTG TGGCCTAAAA TATTGGAGAT AC CAATATTT T AGGTTG CAT CAACATCA 137 8 

(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 60 

SO TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTC a TAAG CATAAAAATG 12 0 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 

55 
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TTATATTCTA TTCAATTTAA TCTATGGATA 
ATACTTCTTT ACATCACTAA GTCAATATAA 
3 CTTAATACTG TCTAATTTTT TTGTAACGTT 

ATGGGCCCTT CTTCTGTTCT ATCCATTGTT 
TATCAATATC ATGCGGATAA TTCATTTGAT 

10 

ATAAATGATA CTTTCCGTTT GGATATACTT 
ATGCCTCTTC ATCACAAACA AATGGTGATG 
CTCTAAACAT GCAGATAACA TTaAACCAAT 

15 

CACGTGTTAT CCAAGTTCTT CCGTCACTTT 
TGACAACATG ATCAGTACCC TTTAATATTG 

2Q CATCATGTTT ATAACTCTGA ATTTTAATGT 

CCATACTCCA CACCACCTTC TGTTAATTTA 
TAGTATTATA TAAATACATA TTTTTACGAA 

25 GGTAATATTG CTAAGTACTA CAACAGAGAT 

ATCACTTTTA TATATCAACA CTTTAACTTT 
TATCATTTAT ACTTTTTAAA ATTAATGTCA 

30 GAATGTTACC CATTTATAAA TTGATAAATs 

TCTCGATCAC TTAAATCAAA CCAAATCATG 
CCTGAAACGG CGTATACTTT AATCTTCCAT 

35 

AAAATAGGTG TTTCTACTGG TTGAATGTCA 
GCATGCTCAC TTTCAAACAT AGGAAATTGC 
TTTTGCAACA GATATTGACC TTGATTATTT 

40 

TTTTTACTTA CATTTTTAGA TTTAACAGGT 
TCACAGTTTT CTTGAACTGG ACAAAATAAG 
GCCCCTAATT CCATCATAGC TTGATTAAAC 

45 

AATTCTTGTT CGTACGATTT CCTCGTCGAT 
AATCTAGACC ATACTCGAAA AACATTTCCG 
SO ATGCTCATTA CTGCAGCTTG TGTGTATGGG 

GGATCTTTGG GAACTAAGCC TTCATATTTA 

55 



CTGTGTCCCC ACACGACAGC AAAAGTTATC 300 

ATGATTTAAT CAGTATTTAC ACTTTATTTG 36 0 

CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 420 

GAAGAATGTC CACGTTGCGT CTTAAAATAA 480 

TCATATGTTG CTCATATTCA TCTTCATCTA 54 0 

TAATATCTAA ATCATAGTCT ATATATTTTA 60 0 

ACAAATTGCA ATAGTAATAA ATTCCATCTT 66 0 

ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 72 0 

CAGTCACTAA CGTATGATCA TTTCCACCAA 76 0 

TTGTTTCAGA CCAAACGCGA TGAATCTTAC 84 0 

TTTCCCCTTC TTTAGGTATG GATTCTCTGA 90 0 

ACCATTATAA ATTATAGCAT ATTT CAGAAA 96 0 

ATAAGATTTT ACTACTTAAT AATTAAACT C 102 0 

TTACATGTCC CATTTAAAGT ATATAAAATC 108 0 

TTGACATTGT TATT CTATG A GATTTAAAGA 114 0 

CTATGTTTTC CGATAATATT ACCAATCATC 1200 

TTTGACATAG GTACAGGGAA TGTATATTGA 126 0 

TCATCTGGTA ATGTTTCAAT GTTAATTGCT 132 0 

GTTAAATGAG TAAATTGATG CTTCAACTCA 13 8 0 

TGACCGATTT TTTCAGTCAT TTTACGTCTA 144 0 

CA CAT AC CAT GCAATAATTT TTCGCTACGC 150 0 

CTAATTAAAA AGACGGATTG CTCAATTACT 156 0 

AACTTTTCAA ATGTACCTTT ATCAAATGCC 1620 

CATAATGGAT TTTTTGGTGT ACAAATTAAC 16 80 

GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 174 0 

TGTAATTTAA TATCTCGATA GTCATCATTC 1800 

TCTACAGTTG CTAGTGGTAC ATTATATGCA 1860 

CCAACACCTT TTAACGCTTT AAATTGATCA 192 0 

TCATGAACTT CTTTAATCGC CGTATGAAAA 1930 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 2100 

ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 216 0 

5 GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 22 2 0 

TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCTTTCCT CACCAAAATT AATTGTCTTT 2 2 80 

ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 234 0 

w 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 24 00 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 24 60 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 2 520 

15 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 264 0 

20 TTCTTTAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 27 00 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 27 60 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACCCTA TTATTTTCAC AGTTCTTTGT 2B20 

25 ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCATTTG CAGTCTTCTT TCCTATAATC 28 80 

GCTTTACTAA TCATTTATTA CATGATTCGT TTTAAAAT G A GAGCCGTAAT TAAGCAACAA 2 94 0 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC G C CAACAAT A 3000 

30 AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATGCAC ATGACTATGT TGGAAAAGCA 3060 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 3120 

ATTTTATGGA AAGT CAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 3180 

35 

TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTT GATTGATTTG 324 0 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 3 300 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 336 0 

40 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 3420 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 34 80 

45 TGCTTTTACA AAATCTAAAA TTTATATTGT TGCTAACAAA CTACCTTTAA TGACTCGAAA 354 0 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 36 00 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 366 0 

50 CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3720 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 37 8 0 
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w 



20 



35 



40 



45 



TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3900 

ACATGATTTG GATTTGGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 3960 

CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 4 020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4 080 

TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 4140 

CTTAAAATAG ACATAATAAC TACTGGCGTC G 4171 
(2) INFORMATION FOR SEQ ID NO: 470: 

£i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGG CTTTCAA 6 0 

AATAAt CAAA GGAGAATAAT TTATGACTAA AACTTTAAAG GTTTATAAAG GAGACGACGT 12 0 

CGTAGCTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT TAGAAGCGGA 180 

TACAACTTAT CCAAAAGGTA CTTACCAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 24 0 

TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 300 

AC CAG AAA CT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCCAA ACATTGCACC 36 0 

AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TTGTTACTGT 42 0 

AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT ACTTCAGTAA TCACTGCTAC 480 

GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 54 0 

ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 6 00 

TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACAA AAGAAGATGT 6 60 

AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 72 0 

AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 7 80 

TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 84 0 

ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AG ATT CAG AA 900 

ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCTC AATAAAAGAC ATTTTATCGA 960 

ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 1020 
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AAGAAAAACA AAGTGAACAA AAAGTCATTA CAGGTACGGA TTTAAGAAAA CTTTTTGGAA 114 0 

GCTAGAAAGG AGGTTAATAT GAATGAAAAA GTAGAAGGCA TGAC CTTGGA GCTGAAATTA 120 0 

5 

GACCATTTAG GTGTCCAAGA AGGCATGAAA GGTTTAAAGC GACAATTAGG TGTTGTTAAT 126 0 

AGTGAAATGA AAGCTAATCT GTCAGCATTT GATAAGTCTG AAAAATCAAT GGAAAAATAT 132 0 

CAGGCGAGAA TTAAGGGGTT AAATGATAGG CTTAAAGTTC AAAAAAAGAT GTATTCTCAA 138 0 

w 

GTAGAAGATG AGCTTAAACA AGTTAACGCT AATTACCAAA AAGCTAAATC CAGTGTAAAA 144 0 

GATGTTGAGA AAGCATATTT AAAGTTAGTA GAAGCCAATA AAAAAGAAAA ATTAGCTCTT 150 0 

GATAAATCTA AAGAAGCCTT AAAATCATCG AATACAGAAC TTAAAAAAGC TGAAAATCAA 156 0 

15 

TATAAACGTA CAAATCAACG TAAACAAGAT GCGTATCAAA AACTTAAACA GTTGAGAGAT 16 2 0 

GCAGAACAAA AGCTTaAGAA TAGTAACCAA GCTACTACTG CACAACTAAA AAGAGCAAGT 168 0 

20 GACGCrtACA GAAGCAGTCC GCTAAGCATA AAGCACTTGT TGAACAATAT AAACAAGAAG 174 0 

GCAATCAAGT TCAAAAACTA AAAGTGCAAA ATGACAATCT TTCAAAATCA AATGATAAAA 180 0 

TTGAAAGTTC TTACGCTAAA ACTAATACTA AATTAAAGCA AACAGAAAAA GAATTTAATG 186 0 

25 ATTTAAACAA TACTATTAAG AATCATAGCG CTAATGTCGC AAAAGCTGAA ACAGCTGTTA 192 0 

ATAAAGAAAA AGCTGCTTTA AATAATTTGG AGCGTTCAAT AGATAAAGCT T CAT C CG AAA 19 80 

TGAAGACTTT TAACAAAGAA CAAATGATAG CTCAAAGTCA TTTCGGTAAA CTTGCAAGTC 2 04 0 

AAGCGGATGT CATGTCAAAG AAATTTAGTT CTATTGGAGA CAAAATGACT TCCCTGGGAC 2100 

GTACAATGAC GATGGGCGTA TCTACACCGA TTACTTTAGG TTTAGGTGCA GCATTAAAAA 2160 

CGAGTGCAGA CTTTGAAGGG CAAATGTCTC GAGTTGGAGC GATTGCACAA GCAAGCAGTA 2 220 

35 

AAGACTTAAA AAGCATGTCT AATCAAGCGG TTGACTTAGG AGCTAAAACA AGTAAAAGTG 2280 

CTAACGaAGT TGCTAAAGGT ATGGAAGAAT TGGCAGCTTT AGGCTTTAAT GCCAAACAAA 2340 

CAATGGAGGC TATGCCAGGT GTTATCAGCG CAgcAGaAGC AAGTGGTGCA GAAATGGCTA 24 00 

40 

CAACTGCAAC TGTAATGGCT TCAGCGATTA ACTCTTTCGG TTTAAAAGCA TCTGATGCAA 24 60 

ATCATGTTGC TGATTTACTT GCGAGATCAG CAAATGATAG TGCTGCAGAT ATTCAATATA 2520 

TGGGAGATGC ATTAAAATAT GCAGGTACTC CAGCAAAAGC ATTAGGAGTT TCAATAGAGG 25 80 

45 

ACACTTCTGC AGCAATTGAA GTTTTATCTA ACTCAGGTTT AGAGGGGTCT CAAGCAGGTA 2 64 0 

CTGCATTAAG AGCTTCGTTT ATTAGGCTAG CTAATCCAAG TAAAAGTACA GCTAAGGAAA 2700 

so TGAAAAAATT AGGTATTCAT TTGTCTGATG CTAAAGGTGA GTTTGTTGGA ATGGGCGAAT 276 0 

TGATTAGACA GTTCCAAGAT AACATGAAAG GCATGACGAG AGAACAAAAA TTAGCAACAG 2 820 
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CAGATAAAAT TAATAGCTAT AG CAAAT CAT TG AA3AACTC TAATGGTGAA AGTAAAAAAG 2 94 0 

CAGCTGATTT GATGAAAGAT AACCTCAAAG GTGCTCTGGA ACAATTAGGT GG CGCTTTTG 3 00 0 

5 AATCGTTAGC AATTGAAGTT GGTAAAGATT TAACGCCTAT GATTAGAGCA GGTGCGGAAG 3 06 0 

GATTAACAAA ATTAGTTGAT GGATTTACAC ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 312 0 

TAGGCTTAGC AATTTTTGGT GCATCTATTG GTCCTGCTGT TCTTGCTGGT GGCTTATTAA 3180 

10 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT ATGCATCATT AAATAGACGC ATTGCTGAAA 324 0 

ATACAATTCT TTCTAATACC AATTCAAAAG CAATGAAATC TTTAGGTCTT CAAACATTAT 3 300 

TTCTTGGTTC TACAACAGGA AAAACGTCAA AAGGCTTTAA AGGATTAGCC GGAGCTATGT 3 36 0 

15 

TGTTTAATTT AAAACCTATA AATGTTTTGA AAAATTCTGC AAAGCTAGCA ATTTTACCGT 34 20 

TCAAACTTTT GAAAAACGGT TTAGGATTAG CCGCAAAATC CTTATTTGCA GTAAGTGGAG 34 80 

20 GCGCAAGATT TGCTGGTGTA GCCTTAAAGT TTTTAACAGG ACCTATAGGT GCTACAATAA 3 54 0 

CTGCTATTAC AATTGCATAT AAAGTTTTTA AAACCGCATA TGATCGTGTG GAATGGTTCA 3 6 00 

GAAACGGTAT TAACGGTTTA GGAGAAACTA TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 3 6 60 

25 GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA ATTATCTTGG AAGTATAGGC AAAAGCTTCA 3 72 0 

AAGAAAAGTT TT CAAAGG AT ATGAAAGATG GTTATAAATC TTTGAGTGAC GATGACCTTC 3 7 80 

TGAAAGTAGG AGTCAACAAG TTTAAAGGAT TTATGCAAAC CATGGGCACA GCTTCTAAAA 3 84 0 

30 AAGCATCTGA TACTGTAAAA GTGTTGGGGA AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 3 900 

TAGAAAAATA CGTACACTAT TCTGAAGAGA ACAACAGAAT CATGGAAAAA GTACGTTTAA 3960 

ACTCGGGTCA AATAACAGAA GACAAAGCAA AAAAACTTTT GAAAATTGAA GCGGATTTAT 4 020 

35 

CTAATAACCT TATAGCTGAA ATAGAAAAAA GAAATAAAAA GGAACTCGAA AAAACTCAAG 4 080 

AACTTATTGA TAAGTATAGT GCATTCGATG AACAAGAAAA GCAAAACATT TTAACTAGAA 414 0 

CTAAAGAAAA AAATGACTTG CGAATTAAAA AAGAGCAAGA ACTCAATCAG AAAATCAAAG 4200 

40 

AATTGAAAGA AAAAGCTTTA AGTGATGGTC AGATTTCAGA AAATGAAAGA AAAGAAATTG 4 260 

AAAAGCTTGA AAATCAAAGA CGTGACATCA CTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 432 0 

AGCAAGAGCG TATTTTAGTA AGAATGCAAA GAAACAGAAA TGCTTATTCA ATAGACGAAG 4 3 80 

45 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 444 0 

AGCAGTATGA AGATGATGTC ATTGCTATAA AAAATAACGT CAACCTTTCT AAGTCTGAAA 4 5 00 

50 AAGATAAATT GTTAGCTATT GCTGATCAAA GACATAAGGA TGAAGTAAGA AAGGCAAAAT 4 560 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA AAAAGCAAAA TAAAGATATT GATAAAGAAA 4 62 0 
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GTTGGTGGTC TAACTTTAGA GAAGACCAAA AGAAGAAAAG TGATAAATAC GCTAAAGAAC 4 74 0 

AAGAAGAAAC AGCTCGTAGA AACAGAGAAA ATATAAAGAA ATGGTTTGGA AATGCTTGGG 4 800 

5 ACGGCGTAAA AACTAAAACT GGTGAAGCCT TTAGTAAAAT GGGCAGAAAT GCTAATCATT 4 86 0 

TTGGCGGCGA AATGAAAAAA ATGTGGAGTG GAATCAAAGG AATTCCAAGC AAATTAAGTT 4 92 0 

CAAGTTGGAG CTCAGCCAAA AGTTCTGTAG GATATCACAC TAAGGCTATA GCTAATAGTA 4 98 0 

10 

CTGGTAAATG GTTTGGAAAA GCTTGGCAAT CTGTTAAATC GACTACAGGA AGTATTTACA 504 0 

ATCAAACTAA G CAAAAGT AT TCAGATGCCT CAGATAAAGC TTGGGCGCAT TCAAAATCTA 510 0 

TTTGGAGAGG CACATCAAAA TGGTTTAGCA ACGCATATAA AAGTGCAAAG GG TTGGCTAA 516 0 

15 

TAGATATGGC TAATAAATCG CGCTCGAAAT GGGATAATAT TTCTAGTACA GCATGGTCGA 5220 

ATGCAAAATC CGTTTGGAAA GGAACATCGA AATGGTTTAG TAACTCATAC AAATCTTTAA 5280 

2Q AAGG'ITGGAC TGGGGATATG TATTCAAGAG CCCACGATCG TTTTGATGCA ATTTCAAGTT 534 0 

CGGCATGGTC TAACGCTAAA TCAGTATTTA ATGGTTTTAG AAAATGGCTA TCAAAAACAT 54 00 

ATGATTGGAT TAGAGATATT GGTAAAGACA TGGGAAGAGC TGCGGCTGAT TTAGGTAAAA 54 6 0 

25 ATGTTGCTAA TAAAGCTATT GGCGGTTTGA ATAGCATGAT TGGCGGTATT AATAAAATAT 552 0 

CTAAAGCCAT TACTGATAAA AATCTCATCA AGCCAATACC TACATTGTCT ACTGGTACTT 5580 

TAG CAG GAAA GGGTGTAGCT ACCGATAATT CAGGAGCATT AACGCAACCG ACATTTGCTG 564 0 

30 TATTAAATGA TAGAGGTTCT GGAAACGCCC CAGGTGGTGG AGTTCAAGAA ATAATTCACA 5700 

GGGCTGACGG AACATTCCAT GCACCCCAAG GACGAGATGT GGTTGTTCCA CTAGGAGTTG 576 0 

GAGATAGTGT AATAAATGCC AATGACACTC TGAAGTTACA GCGGATGGGT GTTTTGCCAA 5 82 0 

35 

AATTCCATGG TGG TACGAAA AAGAAAAAAT GGATGGAACA AGTTACTGAA AATCTTGGTA 5880 

AAAAAGCAGG GGACTTCGGT TCTAAAGCTA AAAACACAGC TCATAATATC AAAAAAGGTG 5 94 0 

CAGAAGAAAT GGTTGAAGCG GCAGG CG ATA AAATCAAAGA TGGTGCATCT TGGTTAGGCG 6 00 0 

40 

ATAAAATCGG CGATGTGTGG GATTATGTAC AACATCCAGG GAAACTAGTA AATAAAGTAA 6 06 0 

TGTCAGGTTT AAATATTAAT TTTGGAGGCG GACTAACGCT ACAGTAAAAA TTGCTAAAGG 6120 

CGCGTACTCA TTGCTCAAAA AGAAATTAGT AGACAAAGTA AAATCGTGGT TTGAAGATTT 6180 

45 

TGGTGGCGGA GGCGATGGAA GCTATCTATT TGACCATCCA ATTTGGCAAA GGTTTGGGAG 6 24 0 

TTACACAGGT GGACTTAACT TTAATGGCGG TCGTCACTAT GGTATCGACT TTGGTATGCC 6 300 

60 TACAGGAACG AACATTTATG CTGTTAAAGG CGGTATAGCT GATAAAGTAT GGACTGATTA 6360 

CGGTGGCGGT AATTCTATAC AAATTAAGAC CGGTGCTAAC GAATGGAACT GGTATATGCA 64 2 0 
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ATCAGGTGCT ACAGGTAATT TCGTTAGAGG AG CACACTTA CATTT C CAAT TGATG CAAGG 6 54 0 

GTCGCATCCA GGGAATGATA CAGCTAAAGA TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6600 

TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 6 660 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA TGTTACTTCG GCTGACGTAG GAAATATCAT 672 0 

TAGCTTGATT CAACACGAAT CAGGAGGAAA TGCAGGTATA ACTCAATCTA GTTCGCTTAG 6 78 0 

AGACATCAAC GTTTTACAGG GCAATCCAGC AAAAGGATTG CTTCAATATA TCCCACAAAC 6 84 0 

ATTTAGACAT TATGCTGTTA GAGGTCACAA CAATATATAT AGTGGTTACG ATCAGTTATT 6 90 0 

AGCGTTCTTT AACAACAGAT ATTGGCGCTC ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6 960 

AAGTGGTCCA AGAAGATATG CGAATGGTGG TTTGATTACA AAGCATCAAC TTGCTGAAGT 702 0 

GGGTGAAGGA GATAAACAGG AGATGGTTAT CCCTTTAACT AGACGTAAAC GAGCAATTCA 708 0 

ATTAACTGAA CAGGTTATGC GCATCATCGG TATGGATGGC AAGCCAAATA ACATCACTGT 714 0 

AAATAATGAT ACTTCTACAG TTGAAAAATT GTTGAAACAA ATTGTTATGT TAAGTGATAA 72 00 

AGGAAATAAA TTAACAGATG CGTTGATTCA AACTGTTTCT TCTCAGGATA ATAACTTAGG 726 0 

TTCTAATGAT GCAATTAGAG GTTTAGAAAA AATATTGTCA AAACAAAGTG GGCATAGAGC 732 0 

AAATGCAAAT AATT AT AT G G GAGGTTTGAC TAATTAATGC AATCTTTTGT AAAAATCATA 73 80 

GATGGTTACA AGGAAGAAGT AATAACAGAT TTTAATCAGC TTATATTTTT AGATGCAAGG 74 4 0 

GCTGAAAGTC CAAACACCAA TGATAACAGT GTAACTATTA ACGGAGTAGA TGGTATTTTA 7 50 0 

CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA TTAGTATTAA GGTTTGGCTA TGATGGTATA 7 560 

GATGTTATAG ATTTAAATTT ATTTGAGCAT TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

CCTTATTATG TTATTACTTC TCAAATGCCT GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

GTTACATCTA ATTTAAAAGA TGGTTCTTCA ACTGAAATTG AAGTAAG TTT AAATGTTTAT 774 0 

4 0 AAAGGGTATT CTGAATCAGT TAATTGGACC GATAGCGAGT TCTTATTCGA CTCTAATTGG 7800 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC ACACCTAAAT ATACTCATAC ATCAAATCAA 7860 

TTTACTATTT GGAACGGTTC TACTGATACG ATAAATCCAC GATTCAAGCA CGATTTGAAA 7 920 

ATATTAATTA ATTTAAATGC GAGTGGAGGA TTTGAACTGG TTAATTATAC AACAGGTGAT 7 930 

ATTTTTAAGT ACAACAAAAG TATAGATAAA AACACTGATT TTGTTTTAGA TGGTGTGTAT 8 04 0 

GCATATCGAG ATATAAACAG AGTGGGAATT GATACAAATA GAGGCATTAT AACATTAGCG 8100 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA GACGTCAGTG ATATTAAAAC TACATTTAAG 816 0 

TTTCCTTTTA TTTATAGGTA GGTGATTTAA TGGATTATCA TGATCATTTA TCAGTAATGG 822 0 
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w 



ATTATGAACT GAATGAAGCT AGGTACATCA CCTTTACAGT TTATAGAACT ACT CAT AAT A 834 0 

GTTTTGTTTT TGATTTATTG ATTTGTGAAA ACTTCATAAT TTATCATGGT GAAAAATACA 84 00 

CAATTAAGCA GACAGCGCCA AAGGTTGAAG GTGATAAAGT TTTTATTGAA GTTACGGCAT 84 6 0 

ATCACATAAT GTATGAATTT CAAAATCACT CAGTGGAATC AAATAAGCTT GATGACGACA 8520 

GTAGCGAAAC TGGTAAAACG CCAGAATACT CTTTAGATGA GTACTTAAGA TATGGATTTG 858 0 

CAAATCAAAA AACGTCAGTC AAGATGACCT ATAAAATAAT TGGAGATTTT AAAAGAAAAA 864 0 

TACCAATTGA TGAATTAGGT AATAAAAATG GCTTAGAATA TTGTAAAGAA GCAGTAGATT 8 70 0 

TGTTTGGTTG TATTATTTAT CCAAATGATA CGGAGATATG TTTTTATTCT CCTGAAACAT 8760 

15 

TCTATCAAAG AAGCGAAAAA GTAATAAGGT ATCAATATAA TACTGATACT GTGTCTGCTA 882 0 

CTGTCAGTAC GTTGGAATTA AGAACAGCTA TAAAAGTTTT TGGGAAAAAG TACACAGCCG 888 0 

20 AGGAAAAGAA AAATTATAAT CCTATTAGAA CAACTGACAT TAAATATTCA AATGGTTTTA 8 94 0 

TAAAAGAAGG TACTTATCGT ACCGCAACAA TTGGGTCTAA AGCTACTATT AACTTTGATT 900 0 

GCAAGTATGG TAATGAAACA GTTAGATTTA CAATAAAaAA GGGCTCTCaA GGTGGAATAT 906 0 

25 ATAAGTTGAT TTTAGACGGC AAG CaAATTA AGCaAATTTC TTGTTTTGCT AAGTCGGTTC 912 0 

AGTCTGAmAC AATAGATTTA ATA aAAAATA TTGATAAAGG CAAGCACGTT TTAGAAATGA 918 0 

TATTTTTrGG AGArGrCCCC AAAAATAGAA TTGATATATC TTCAAATAAA AAAGCTAAGC 924 0 

30 

CTTGTATGTA TGTTGGAACT GAAAAATCAA CAGTCTTAAA TTTAATTGCT GACAACTCAG 93 00 

GTCGCAATCA ATACAAAGCA ATTGTTGaCT ACGTCGCAGA TAGTGCAAAG CAG TTTGGG A 93 6 0 

TTCGATATGC TAATACGCAA ACAAATGAAG ATATCGAAAC ACAGGATAAG CTGTTAGAAT 94 2 0 

35 

TTGCAAAAAA GCAAATAAAT GATACTCCTA AGACTGAATT AGATGTTAAT T AT AT AGG TT 94 8 0 

ATGAAAAAAT AG AG CCAAG A GATAGCGTAT TCTTTGTTCA TGAATTAATG GGATATAACA 954 0 

40 CTGAATTAAA GGTTGTTAAA CTTGATAGGT CACATCCATT TGTAAACGCA ATAGATGAAG 960 0 

TGTCTTTCAG CAATGAAATA AAGGATATGG TACAAATTCA ACAAGCGCTT AACAGACGAG 966 0 

TTATTGCACA AGATAATAGA TATAACTATC AAGCAAATCG TATAAATCAT TTATACACTA 972 0 

45 GTACTTTGAA TTCTCCTTTC GAGACAATGG ATATAGGGAG TGTATTAATA TAATGGCAAC 9780 

AG AAG AAG TT AAAATCAAAG CGCTACTTGA AAACGATAAA C 9821 
(2) INFORMATION FOR SEQ ID NO: 471: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471; 

TTATTGTTTT CCAAATGGTC TTCTCGGTTA TTTAACCATC GGTTTCAATC CTACGTGCTT 6 0 

5 

CACTTTCTAT CtATTAATTC AaTTTCAGCT TGACCACCCG CTGTATAAAG GGTCAAAGTT 120 

GCTAATCGAT AGCGTCTCAT TATAGGACCA ACATCGATAT CAATATTTTG AATACGAAAA 180 

w TATGGTATTA CCTTTTCATC CAAAAATAAA ATGCCGTTTC GTACACGCAA ATGGTGTTTT 24 0 

TCAAATGCAT ATCTGCAGTG CTTATATCGA TAGACTGGCG CTATAACAAG CGTGAAAACA 300 

GCAACAAGTA ATATTATAAT CACACTACTA ACAATGGATA AATGGTTATC TAAAATCTCC 360 

15 CAAAATAGCC AGTTCAAAAT ATTAAATGCG ATTAAAAGTA CAAGCGCTAT GGGTATCCAA 42 0 

AACAGCACAG CACTTAACCT CATCACTTTT TTAGCGTGTG GTGACATAAA ATTATAATCC 4 80 

CTCATCATTT TCACCTCTTA AATAC CATGA TTTCAATTTA TTTGCATCTT CACTTCTTGC 54 0 

20 

GTATTTTAAG TTAATCGATT GGGCTCCCAC ACCTTTAGCA ACAATAAAGC TAAAATTATT 6 00 

TAAATTGTTG CGTTTAAGTA ATGTATGTTG CCAAGTGTCA AATCCTATAA TGTGATGCGC 660 

TTTAAAATAA TAAATATTTC GTTTCAATAG CTCGAAATTC TGGATAGTAA TTTGTTCTTC 72 0 

25 

TGTCATTTTA AAACCCGCAT GTTTGACATA AAGATATCCT TTGATCACAA ATAAACCAAT 780 

AATGACTATT GTTATAATCG TAAATAACAA TAATAATTGA TTCCAAAAAT AACAGCCTAT 84 0 

3Q ACCTGCCATA GCTATGACAA TAATACTAGG TATTAAAAAG TGTCTGTGGA AACCTGACAA 900 

AGGCATACCT TCATTAACTT GTTGATAAGA TAAATCTGGT ACTAAATTCT GGATAATTTG 96 0 

ATATGCTTTG TCTCGTTTAA TAAACGGCaA TATCGGCACA CTACCTGAAT CATTGTC 1017 

35 (2) INFORMATION FOR SEQ ID NO; 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 472: 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 6 0 

GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 12 0 

TT ATAG C AAA TCGGATAGTG ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 1B0 

TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 24 0 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 360 

AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 420 

5 TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 4 80 

CATAACAAAT GGATTGCTAA TAACTTGATT TTGTTTTAAG CGTTTATAAG CAAATATAAT 54 0 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 600 

10 

AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 660 

TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTT CTCT A TTGTAATAGT 72 0 

TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 780 

15 

AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 84 0 

ATATTCTCCC ATTAT CAT CA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 900 

^ AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC C T AT ATG AT A 960 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 102 0 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 108 0 

25 CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 114 0 

TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 120 0 

TAT ATT AA T A TATAATCAAA TCTTAGATTA ACT AG TGTAA TGATACAGAT GATAATTGAG 126 0 

30 TACAAATTTA AAACCCTGAG ATTTTCGCTT TAATTTGAAA AC CTCAGGGT TTATTTGATT 1320 

TTTATATAAT GAATCGTTAC ATTAAAAATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 144 0 

35 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA 1500 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 156 0 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 162 0 

40 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 168 0 

AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 174 0 

45 CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTG 180 0 

CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA 186 0 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 192 0 

50 GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 198 0 

AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 2 04 0 
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GCAATATTAC TAAAAC CAT A CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 22 2 0 

5 TAATACCTGT GATTACTGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATTGTAT 228 0 

TAGATTGCAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 2 34 0 

TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACGGAATC 24 0 0 

10 CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 24 6 0 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 2 52 0 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAACCAGCA CTAATCGCAA 2 580 

15 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 2 64 0 

AAGCTCTAAA CATTTCAGGT AACATTCGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 2700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA C CAT ATT AAA TCGGTACCAA 276 0 

20 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2 82 0 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 2 8 80 

25 GTAAATATAG AGA7ACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 2 94 0 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAGC ATTTAATATA AATCCCTTGA 3 000 

ATTTAGTCAA AATTTAA CAT TACTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3 060 

30 TTTAGCTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 312 0 

CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 3180 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 324 0 

35 GTGGATTTTT CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGCAACTT 3300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AG CTTT AAG A TTTATTGATT 3420 

40 

TAACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GTTAATGACA 34 80 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 354 0 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATT AAAACG C GTTGTTTTAA 3 60 0 

45 

ATAGAGGCAC TGTACTTTTC TTAATCATGC CATTTTGAGT CGCAAAAACA TAAAATGCAT 366 0 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 3720 

50 CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3 780 

GTATAAATAG ATAACGACCT TTATTTGTAA AT ACT AG T AC GGTATCTTGC GTATTTACTT 3 84 0 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 3 96 0 

CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 4 020 

AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 4 08 0 

TCAATAATGC ATCATGGTTA TCAAGAATAT GACGTAATTG TTTGATTAAT GCTTCAAGTT 414 0 

CTTTATGTTC ACCTTCAAGC GCAACTATGT CAGTATTTGT TAAACGATAT AACTGTAACA 420 0 

TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 4260 

CGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTAT CTAAA ATTGACAACG 432 0 

CTTTAATCAA ACCTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAACTTCG 4 3 80 

TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 444 0 

C CAT CAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 4 500 

AATCAGAGTT TTTATAAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 4 560 

CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 4 62 0 

CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG TTCACTTCAT 4 68 0 

ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCG TAAAGTTTCT TCTTCAACTT 4 74 0 

TAGAACGAAC T AT AATTC T A CCTTTACCTG ATTCATAAGC TTTTTTAATA C C AT CAAT AC 4 80 0 

CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 4 86 0 

CTGTAATATC CGGATTATCA ATATATTTAA GTGTTGCTTG AATCACTTCA GCTAAATTAT 4 92 0 

GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC ACTAGTAAGT 4 98 0 

TAGGAAATCT TGATGGCAAT A C CATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 504 0 

AAGAAACTGT CTCTTTATTA ATATCACGTA ATAACTCTTC AGCTAGTAAG CTTAACTTAG 510 0 

CTTCAGTGTA ACGCATTGCC GCTGGCGGAT CATTATCGAT ACTACCATTA TTACCATGCA 516 0 

TTTCTATTAA GACATGTCGT AACTTC CAGT CTTGACTTAA ACGGACCATT GCTTCGTACA 522 0 

CTGAGGAGTC TCCATGTGGA TGATATTGAC CAAT AA CATC ACCGACTGTT TTCGCACTTT 52 8 0 

TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TGCATATAAA ATACGACGTT 534 0 

GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA ATATATTTAC 54 0 0 

TATATCTTCC AAAGCGATCA CCTAAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTCAC 54 6 0 

TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATC 5 52 0 

TAAAATA CTT TGGTCCTCTT GCATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 55 8 0 

AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCAT CTTC 564 0 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTGTAACGTG AaGCCTTTAC CAAGTTCTTT 5 76 0 

TTGCAATTTA TTAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCGCTTTG TTTTGCCTTT 5 820 

ACCTTTTTCC AATTTATAAA GTGGAGGTAA AGCAATAAAT ACACGACCTG CTTGAACAAG 5B8 0 

CGG TTT CAT A TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5 94 0 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6 000 

ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 6 060 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTGGAA 6120 

TTTGCGGTCT CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 6180 

TTCATTTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 624 0 

TTTACGCTTG TTTTTCTTAC CTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CTTTTTCTTC 6360 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 64 8 0 

CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT CTTTTGTTTT 6 54 0 

AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 6 600 

AACTTCATGT GTACCACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTTC 6 66 0 

TGAATATTGA TCATTATATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 6 720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 6780 

G ATTCCTT CT TATAATGGTA TGTCTT 6 8 06 
(2) INFORMATION FOR SEQ ID NO: 473: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE: nucleic acid 
40 CO STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTnn 6 0 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

TTGAATTTAA AC ATT CTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 24 0 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTTGGAT ACGAACATCA 36 0 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTGATAT CTTCAACTGA GATATTAAAT 4 20 

TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTTTTTCA 480 

GCTTCAG CG A TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 54 0 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 

AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 660 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTGTAAT CGCTTCTTCT 720 

TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 78 0 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAG CAATTT C ATCTG TTAAT 84 0 

TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 900 

GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 96 0 

20 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA C CAGGT AT G A ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGTCAACTGA ACCACTAAAG 108 0 

25 TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACGAC CATTTCAGCT 114 0 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATC AG AT AA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 12 6 0 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 1320 

GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 13 80 

TCG AT AG CAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 144 0 

35 ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 150 0 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAAT AAA C C TTCGTTACCT 156 0 

TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATACCTCC ATGATTAACT AATTTATTTT 162 0 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 16 8 0 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 1716 
(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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TGACCAAGTA CGTTTCGAAG TTGCCATTAA AGCATTAAAC CCATCATTGA AAGCATTCGC 6 0 

ACCTGTACGT GAGTGGGCAT GGAGTCGTGA AGAAGAAATC GATTATGCAA IT AAA CAT AA 12 0 

TATCCCTGTA TCAATCAACC ATGATTCACC TTATTCTATC GATCAAAATC TATGGGGCAG 180 

AGCGAATGAA TGTGGTATTT TAGAAGATCC TTATGCTGCG CCACCAGAGG ATGCGTTCGA 24 0 

TCTAACAAAT GCTTTAGAAG AAACACCAGA TACTGCTGAT GrAATCATTT TAACGTTTGA 300 

TAAAGGCATC CCAGTTCAAA TTGATGGCAA AACATATGAA TTAGACGATT TAATTTTAAC 36 0 

GTTGAATGCA TTAGCTGGTA AGCATGGTAT CGG AAGAATT GACCATGTAG AAAATAGACT 4 20 

TGTAGGTATC AAATCAAGAG AAATTTATGA GGCACCTGCT GCAGAAGTTA TTTTAAAAGC 4 80 

GCATAAAGCA TTAGAAACGA TTACGTTAAC GAAAGATGTC G CACACTTT A AACCAATCAT 54 0 

TGAGAAGCAA TTTGCTGAAC AACTATACAA TGGACTTTGG TTCTCACCTT TAACTGATAG 600 

CTTGAAATTA TTTATTGATA GTACTCAGCA ATACGTAAGT GGTGATGTCA GAATTAAATT 66 0 

ATTCAAAGGT AATGCCATCG TGAATGGTAG AAAATCACCT TACACATTAT ATGATGAAAA 720 

ATTAGCAACT TATACAAAAG AAGATGCATT TnATCAAGAC GCTGCTGTTG GCTTTATCGA 78 0 

TATCTATGGT TTACC 7 95 
(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 75: 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 6 0 

GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATGATG r GaAATATTAT 12 0 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 180 

GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 24 0 

ATGTGGGAAT GCCCAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 3 00 

CAAGGTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 3 60 

GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 420 

GGTTTTGATT TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTA CTAATT 4 80 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 54 0 
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G CGTTGG AAA AATTACGTCA CAATAAAGAG ACAGCATTt A GGc tACGCAA ATAAATTTAC 6 60 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA AT CAT AG ATA TTTTGGATAA 720 

5 TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 7 80 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 84 0 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 887 



(2) INFORMATION FOR SEQ ID NO: 4 76: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

,5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

20 

TTGGAAnCAA AAAACCATTG GTAAACCGTG TTnAACCGGA TTTCCGATGG ACCTTTTAAA 6 0 

ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 12 0 

25 AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT 180 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 24 0 

AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 30 0 

30 AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG TCTAGAAGTA AAAATATTAT 36 0 

ATGATGATGT TGGATCTAAA AATGTTAAGA TGG CAAATTT TGATCATTTT AAATCGTTAG 420 

GTGGAGAAGT TGAAGCATTT TTTGCTTCAA AATTACCGTT ATTGAATTTC AGAATGAATA 4 80 

35 ATAGAAATCA T AG AAAAAT C ATCGTAATCG ATGGT CAACT AGGTTATGTC GGAGGATTTA 54 0 

ACATTGGTGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 6 00 

GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 66 0 

40 

AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGACCAT 720 

TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGACTGG CATCAAATTG 7 80 

AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 84 0 

45 

ATTTCATTCC GGATAATTCA TATATaAATG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 900 

ATGTACATTT AATGATTCCA TGTAAGCCAG AT CAT C CATT AGTATATTGG GCGACATTTT 960 

50 CAAATGCCTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA 102 0 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 10 80 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 1183 
(2) INFORMATION FOR SEQ ID NO: 477: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

GGTTATATGT TTGGTATTAC TCATTATATT GATTGGAGGT TGTGTCATTA TGACAAAAAC 6 0 

15 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 12 0 

CAAGAATCTC GATGATTTTT AC G AT AAGG A AGGTTTTCGA GATCAAGAAT TTGATAAAAG 180 

AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 24 0 

20 

GGAATCGAGA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 3 00 

TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 360 

25 TCCTGTCAAG A TGG AAAAT A ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 4 20 

AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 4 80 

AGATTATAAA AATGG TG AT A TTTCATATAA TCCTAATGTG CCAAGTTATT CTGCAAAGTA 54 0 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 60 0 

CAAACAAGCG CCCGAATTAA AATTGAAAGG ATCCGGCAAT TTAAAAGGCT CATCCGTAGG 660 

ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAGAA GAGAATATCT ATTTTTCAGA 72 0 

35 TTCGGTCGAA TTTACACCTA GCGAGGATGA TAAATCATGA GTCAAACGGA ATATCAAATA 78 0 

AAATCTGGCA ATATAAAAGG TAACTCTGAA GAAACAAGTA CAGTATCTAA TATAAGTTAT 84 0 

GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 90 0 

40 

AAGTTACAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 96 0 

CCCAAAACAG GCACGACTAC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 102 0 

GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 108 0 

45 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CGATGTCAAT 114 0 

ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 1200 

50 AAAAATATCA AAAAAG AC TA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 12 6 0 

GATGCGATGA TTTTAGGTAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 132 0 
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ATTGAAAAGT ACGATGGTCA CATTG TAAG A TTTGTGTCTG ATGAAGACGA ATTAGATGCA 144 0 

GGTGTCCGCA ATCATTTATA TGAAACTGCT GGAGAAAAAA TAGTACTTAA AAATGGAGAA 1500 

5 GGCCATGCAA TGAGTGGTAT TTTAATGAGC AGAACACAGG CTATAATCTT AG CTG AATT A 156 0 

AACAAAGTTA AAGGCTACCA AGACGAAAAT AATAAAGCAT TAAAATCCGT TCGTAAACAA 162 0 

ACGAGGCATA GATTACATAA AGTAGAGACG TTAAGAGCGA ATTGGATTCA AACAACGGGT 1680 

10 GGATCACTCT CTTCCTCCCw ACAACAATTA TTAGAAGCTT TAACAGCACT AACCATTGCC 1740 

GAAGGCTTAA ATCAATTAGT GAATGAAGAA AGCCAACATT TGAAAAAATG TATCACGCGA 1800 

TGGCACATAA ATTTGGAGAC AACTGGAAAA AAGCGCAAGA AGTTGGAAAT GAAATTGGTG 1860 

15 

AAAAATTAAC CTCTGAAGAG GTTATAGATG rATTAAGAAA AGGTGGCGCG TATGAAAGTa 1920 

AACTTGAAAC AGATCCCAAA AGAAAAATTG ATGATAAGAT AAAGAAATTA AATGATGTTT 1980 

ATAAAAATTG TAATGGCTAT ATCGCAAAAA TTAAACAGAG TATCGAAGCA ATTGTTTCTA 204 0 

20 

ATGACCAAAT GTTAGCGAGC CAGATTGATG GGATGATGTA ATGTTTACTA CGTATAAnAA 2100 

TATTAATGAA CTTGAAAATG CCTATGATGA AGAAAGAAAA CAATTGAATG ATGCATTCAA 2160 

25 TCAAATTGAT GAATTAAGAC ATCAAACACG CAAGAmATGT GAACAAATGT ATGATCATTT 2220 

CTTATATCTC AAACATAAAA TGAATTmyms TGAAGACGCT ATGATCAGGA TGACACGTAT 22 80 

TATAGAATCT TTCGATAGAG AAACGAATCA ACGTATCCGA CATCACGAAA TG 233 2 

30 (2) INFORMATION FOR SEQ ID NO: 4 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 478: 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATGGT GGAAAGTCTT 60 

TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 120 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTT CTAA TGAATCAAAG 180 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 24 0 

GATTGTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 3 00 

GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 360 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTACCTTC AACGCCTTCT TCATCATATG 420 
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GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTTCAG 54 0 

CATGTTTACG TAAGTCGAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 

ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 66 0 

CATGATCACG TAAACGCATA TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTG 72 0 

CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 78 0 

GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 



GGaATGaTTT ACCAATTTGG ACCGG 865 
(2) INFORMATION FOR SEQ ID NO: 4 79: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

25 TAATGAGTAA ATAAGTACCA GATAAAATCA TGAATATCAT CCAACATGCG GTTAACTCTA 6 0 

CT AAA T AA TT AATAATAGTA TTTTCAGTAA ACAAAGAACT ATGTATACTT CGCATCACAT 12 0 

TAGAATACGT ATGTTTCGCA TTTTGATCTG CAACAAATTG ATTGTTATGA TCTAGAAAGA 180 

30 CGTAACGTTG ATTTCCTGCC ATATCACTCA GTGTAATTCG TTTGTTATAT GGTTCATCAA 24 0 

GTATGCTAAC TTTACTTACA AAGAATCCTT CATATTGTTG TTCAACTTGA TGTACTGCAT 3 00 

CATTTAATGT TTGATGCGTT TTTACATCAC TGTCACCAAA AAACTCATTC TTATAAATAT 3 60 

35 

TATTTTCAAC TTCTGGAAAG AACAAGTAAC CAATGCCCGA AATGGTTAAA GTGATTAACA 42 0 

GTGGAGCAAT AAATATTGCT GCATAGAAAT GTAATCTTTG TAATGGATTA AATGTATTTT 4 80 

T CAT ATTTC C CTCCCAATTG G CT ATT AT AC GGTGTCAATT CTGTGATGTG TGTGAACAAA 54 0 

40 

CTGTGACAAT ATTTATTTTC TAGAAAAATT TAACGATGAT TTGTGATTTT TAGAAAAATG 60 0 

AACTTTTAAG TTGGAATGTT TGAAGAAAAT TG ATT ATT CG TATGTTTTAT CAAGCAGCTA 66 0 

TGATAAAATT TAAACATAAT ACAATGCGAG CCATTTAACG ATCTATGTTT AAATGGACAT 720 

45 

CGATATTGTA TGAATTCGTT GTAACAAGCA AGCATTTCTA TGTGAACGAA CCAAAGGGGA 78 0 

AAGTAACATG ATTAATAAAG AACAATTAGA T C TTTTATAT AAATTAAAAA AAGAAGTTGA 84 0 

SO AAAGTCGCGA AATGAAGCAC TTTTACATAC AATTAACCAA GTAATTAAGA AAGTATATTT 90 0 

GCAGCAATAT ACATGTTCGT TCGTTGGACA TTTTTCTGCA GGTAAATCGA CACTGATAAA 96 0 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCCGAATC AAACGTATGC 108 0 

CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC CAAAATGTCG ACGTTGAATC 114 0 

5 TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 12 0 0 

AGGTGTTGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TGTATACAAG 12 6 0 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTGAACTTA ACTTTAAGTT 13 2 0 

10 TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTGACAG 13 80 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 14 4 0 

TGGC 1444 



15 

(2) INFORMATION FOR SEQ ID NO: 4 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

GCAGCAGCTT ATCGTGAGTT ATCATTATTA TTACGTAGAC CTCCAGGTCG TGAAkCATAC 6 0 

CCAGGTGACG TATTCTACTT ACATAGTAGA t TATTAG AAA GAGCAGCAAA ATTAAACGAT 120 

30 GACTTAGGTG GCGGTTCAAT TACTGCATTA CCAATTATCG AAACACAAGC TGGTGATATT 180 

TCAGCTTATG TACCAACAAA CGTTATTTCA ATTACAGATG GACAAATCTT CTTACAATCT 24 0 

GATTTATTCT TCTCAGGTGT AAGACCAGCG ATTAATGCCG GACAATCTGT ATCTCGTGTT 3 00 

35 GGTGGATCTG CACAAATTAA AGCAATGAAG AAAGTTGCTG GTACGTTACG TCTTGACTTA 36 0 

GCGTCATACA GAGAACTTGA ATCATTTGCA CAATTCGGTT CAGACCTTGA TGAATTTACT 420 

GCAAGTAAAT TAGAACGTGG TAAACGTACT GTTGAAGTCT TAAAACAAGA TCAAAACAAA 4 80 

40 

CCATTACCAG TCGAACACCA AGTGTTGATT ATTTATGCAT TAACAAAAGG ATATTTAGAT S40 

GATATTCCAG TTGTAGATAT CACACGTTTT GAAGACGAGT TAAACCACTG GGCAGAATCA 6 00 

AATGCTACTG AACTGTTAAA TGAAATCAGA GAAACTGGTG GCTTACCAGA TGCTGAGAAG 660 

45 

TTTGACACAG CAATTAACGA ATTCAAAAAA AGCTTTAGCA AATCTGAATA ATAAACAAGT 72 0 

TTAGTATAAG GTGGTGAGAT AGTGGCTTCT CTTAAAGAAA TAGATACTCG AATAAAATCA 78 0 

50 ACCAAAAAAA TGAAGCAGAT TACGAAAGCG ATGAACATGG TATCAAGTTC AAAACTTCGT 84 0 

AGAGCTGAAA AAAATACAAA ACAATTCACA CCATATATGG ATAAAATGCA AG ATG CAATT 900 
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ACTAGAAGTG GATATTTAGT TATCACGAGT 
AACGTGCTTA AAAAATTGAT TACTGATATT 

5 AGTATTGTAG TTTTAGGGCA ACAAGGTGTT 

GAGTATTCTC AAGTAGACGT ACCTGATCAA 
AACCATGCTA TAGACTTATA CAGTGAAGAA 

W CATTATGTCA GCGTTCTTGA AAACAAGCCT 
GAGGATTCTA GTAAGGGGCA TGGTCATTTG 
TCTATCTTAA GTGTAATCTT GCCTCAATAT 

15 

GACGCAAAAG CAAGTGAGCA TGCAACACGT 
GCAACTGAAC TTATTGATGA CTTAT CATTA 
ACGCAACAAA TTACTGAAAT TGTTGGTGGT 

20 

GAAAATAGCA TGGGAATTGG CCGTGTAACT 
TTTGAACATA ACGAAgTTCC TAAAATTAAT 

25 GAAGGTACAA TACAACTAAC ATTAGAAGTT 
ACAATTGCGA TGGATTCAAC TGATGGTGTC 
AAAGAAATTA GTGTACCTGT TGGTGACGAA 

30 GAAACAA7TG ACCTTAAAGA AGAAATTAGT 

CAAGCACCAG CATTCGATGA ACTTTCAACA 
GTAGTAGATT TACTAGCACC TTATATTAAA 

35 GGTGTAGGTA AAACAGTATT AATCCAAGAA 
GGTATTTCTG TATTCGCCGG TGTAGGTGAA 
GAAATGAGTG ACAGTGGTGT AATTAAGAAA 

40 

CCACCTGGTG CACGTATGCG TGTTGCATTA 
GACGAACAAG GTCAAGACGT ATTATTATTC 
GGTTCTGAGG TATCTGCATT ATTAGGTCGT 

45 

CTTGCTACTG AAATGGGACA ATTACAAGAA 
ACTTCTATTC AAGCGGTATT CGTACCTGCC 
50 GCGTTTGCCC ATTTAGATGC AACTACAAAC 
TATCCAGCCG TGGATCCATT AGCGTCTACA 
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GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

GAAGCGAAAC ATCAAGATAG TAGCGAATAC 1080 

GATTTCCTAA AAAATAGAGG TTATGACATT 114 0 

CCTTCTTTCA AATCTGTTCA AG CACTAGCT 1200 

GAAATTGATG AATTAAATAT ATACTATAGT 126 0 

ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

TCTTCTTATG AATTTG AG C C AGATAAAGAA 13 80 

GTTGAGAGTT TGATTTACGG AACAATATTA 14 4 0 

ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

CAAGTTATGG GTCCTGTAAT TGATGTTCGA 16 80 

AACGCCTTGG TTATTGATGT GCCTAAAGAA 174 0 

GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 180 0 

CAAAGAGGCA TGGATGTAAA AGATACAGGC 1860 

ACATTAGGTC GTG T ATTTAA TGTACTAGGT 192 0 

GATTCTGTTC GCCGCGATCC TATCCATCGT 1980 

GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 204 0 

GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

CGTACTCGTG AAGGTAACGA TTTATACTTC 2220 

ACAGCCATGG TATTCGGGCA AATGAATGAG 228 0 

TCTGGTTTAA CAATGGCTGA ATATTTCCGT 234 0 

ATCGATAACA TTTTCAGATT TACACAAGCT 24 0 0 

ATGCCTTCTG CAGTAGGTTA CCAACCAACA 24 60 

CGTATTACGT CTACAACAAA AGGATCAGTT 2 52 0 

GATGACTATA CTGACCCAGC GCCTGCGACA 2 580 

TTAGAACGTA AATTAACTGA AATGGGTATT 2 64 0 

TCAAGAGCAT TGG AAC CAT C AATTGTAGGT 2 70 0 
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CAAGATATCA TTGCTATCTT AGGTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 2 8 80 

ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAGATATC 294 0 

TTAGATGGTA AATATGACCA TATTCCAGAA GATGCATTCC GTTTAGTTGG TAGCATGGAT 30 00 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATGC AAACAACAGC TGGTGAGATA GGTGTCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 3240 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AGTTAGAAAA GATAAAGTTT CAATCATTGT 33 00 

TCAGACTGCA GAAACTGCAA GAGAAATTGA TGTTGAAAGA GCTAAATTAG CCAAAGCAAG 33 60 

AGCAGAGTCT CACTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 34 20 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 34 30 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 3 54 0 

ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT TTATGATGTA GTGGTTCTTT 3 600 

ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT 3 6 60 

AAGAAATACT AATTTCTAAA G AAAAAG TAT TTCTTTATGT TGGGGCCCCG TCAACTACTG 3 72 0 

CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC CGACTCCCAA GAATAGTTTT 3 780 

ACTTTTTTAC AATCACTAAT AGATTGCTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3 84 0 

TCGACATTTC ATTTTTTGAA ATTATCTACA TTTTTT CAT A CCAAGATATT TTATAGTTAT 3 900 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 3 960 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4020 

TGATAAAATG G ATTAT AT CG GACAATATGC AGTTATCCAT TTAGTGTTAC ATGTTGTATG 4 080 

TATTTGTATT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4 200 

AAGCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA 4 2 60 

AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT 4320 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 43 8 0 

ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 444 0 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAGCTTCTG ATAAACCGAG CAAATTAGTT 450 0 
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GACGTTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4 62 0 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 4 68 0 

5 

CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 4 74 0 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAGCTTTAG GCGCAGAAAT TCATCTTGAA 4 800 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 86 0 

W 

TTTCCAAGTG TAGGAGCAAC ACAAAATATT ATTATGGCAG CATCATTAGC TAAGGGTAAG 4 92 0 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCG ATTTAGCAAA CTACATTAAT 4 980 

GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 5 04 0 

15 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG C AC ATT ACT A 5100 

ATCGCTGGTG CTATAACGCG TGGTGATATT TTTGTACGTG GTGCAATCAA AGAACATATG 5160 

20 GCGAGTTTAG TCTATAAACT AGAAGAAATG GGCGTTGAAT TGGACTATCA AGAAGATGGT 5220 

ATTCGTGTAC GTGCTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 52 80 

GGATTCCCGA CTGATATGCA ATCACAAATG ATGGCATTGT TATTAACGGC AAATGGTCAT 534 0 

25 AAAGTCGTAA CCGAAACTGT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 54 00 

ATGAATGCTA AT AT CAATGT AGAAGGTCGT aGTGCTAAAC TTGAAGGTAA AAGTCAATTG 54 60 

CAAGGTGCAC AAGTTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 5520 

30 

TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGGCTATGTT 55 80 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 564 0 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 5700 

35 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 58 20 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAGCGTT AcTCAAACAG 58 8 0 

40 

GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 594 0 

TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 60 0 0 

45 AAATCACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTCGATGGTC 60 6 0 

AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 612 0 

ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 

SO TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 62 4 0 

TTCACCAGAT AATCCATCAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 63 00 
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(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA 60 

AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGTTAAAA AAGCTGTTGA 120 

15 

AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTGGTATCG TACATGCATC 180 

AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 24 0 

20 TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 3 00 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATACTGCA AGTTTCAAAT AATAAATGAT 360 

ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 420 

2$ T AATTTC CAA GTTATATTAC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 4 80 

TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTGACTTGA ACGTGATGAT CTATGATCTT 54 0 

TCAAGCACTT TTTGCCGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 600 

30 AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 660 

ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 720 

TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TGAAG CTGGT GTTGAGTATA 78 0 

35 

AAGTATACAA AAACACTATG GTACGTCGTG CAGCTGAAAA AGCTGGTATC GAAGGCTTAG 84 0 

ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTGCAG 900 

CGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 96 0 

40 

TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 102 0 

ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TTCGCTTATG 1080 

45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAATTTTTA GCGTAAAAAA 114 0 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAATCATGA ACAAATCATT GAAGCGATTA 1200 

AAGAAATGTC AG TATTAGAA TTAAACGACT TAG T AAAAG C AATTGAAGAA GAATTTGGTG 126 0 

50 TAcTGcAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTGGT GGCGCTGACG CTGCAGCAGA 1320 

AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 1380 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 



1500 



AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 



1560 



AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 



1620 



TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 



1630 



AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 



1717 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

GTAAATCTGT TACTCGAAAT GTAACGATTA AAGAAAAGGG CTCATCTCAA ACATATATTT 6 0 

TGTTAGGCTA TCCAACAAAA GCACAGAAGA ATAGTCATAG CAAATATAGT GGAGTCTTTA 12 0 

TATATAAAGA CTTGAAATCA ATCGAAGATA CAAATAATGC TATTACGATT ATCACCATAA 1B0 

TTACGGCTGT TATTTTCTTA ACAATTACAA CAGTCTTTGC GTTTTTCTTA TCGTCAAGAA 24 0 

TTACAAAACC TTTAAGACGT TTAAGAGACC AAGCTACACG TGTATCTGAA GGGGATTACT 3 00 

CTTATAAACC TTCTGTCACA ACGAAAGATG AAATTGGTCA ATTATCGCAG GCATTTAATC 360 

AGATGAGTAC AGAAATCGAA GAGCATGTCG ACGCATTATC CACATCTAAA AATATTAGAG 42 0 

ACAGCTTAAT TAACTCTATG GTAGAAGGTG TCCTAGGTAT TAATGAGAGT CGACaAATTA 480 

TCTTATCTAA TAAGATGGCG AATGATATTA TGG A CAAT AT TGATGAAGAT GCTAAAGCTT 54 0 

TCTTATTAAG ACAAATAGAA GATACTTTTA AATCAAAACA AACTGAAATG CG CGATTT AG 6 00 

AAaTGAATGC ACGATTCTTT GTTGTGACCA CAAGCTATAT CGACAAGATT GAACAGGGAG 660 

GTAaAAGTGG TGTTGTTGTG ACAGTTCGTG ATATGACTAA TGAGCACAAT CTAGATCAAA 72 0 

TGAAGAAAGA TTTCATTGCT AATGTATCAC ATGAATTACG TACACCGATA TCATTACTTC 78 0 

AAGGTTATAC TGAATCAATT GTAGATGGTA TTGTTACAGA ACCGGATGAA ATAAAAGAAT 84 0 

CGCTTGCCAT TGTCCTTGAT GAATCGAAAC GTTTAAATCG TTTAGTTAAT GAATTGTTAA 900 

ATGTCGCACG CATGGATGCT GAAGGGTTAT CCGTAAATAA AGAAG TTCAG CCTATTGCAG 960 

CGTTACTAGA TAAGATGAAA ATTAAGTATC GCCAACAAGC TGATGATTTA GGTCTAAATA 1020 

TGACTTTTAA TTAyTGTAAG AAGCGTGTTT GGAGTTATGA TAwGGATCGC ATGGACCAAG 10 8 0 
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TTACTTGTGA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 1200 

TTGCACCAGA A CAT TT A OA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 126 0 

ACCCCGnGGT AACCAnGTA 1279 
(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 

GAGCTGTTGT TACTTTGATG CCTGCAGCTT TATTACGGCT GACTTGGTAA TGATAAGTTT 6 0 

CAGCATATTG CTCAATATAT GOT AT AT CAT ATTGAATGGT ACGAGGTGAT ACACCAAGTT 12 0 

GATTAGCAAT GGTATTGATT GGAATAAACG TTTGCTCATG AATTAAAAGA TACAAAATTT 130 

CGATTTGTCT ATAACTTAAC AACGTAATAT CCTCCTATTT GTAATTGTAA GCGATTTCTT 24 0 

AAAAACGTAG ATATGCAATC T CTTTCAT AT TTTAATCCGA AAAATTGCAT ATCAAAATGT 3 00 

TTATGGCGCA AGATTTTATA GGAACTTTTA AAATAAATTA rATATTCATG TTGACAATTT 360 

AAAAATGTCG CAGTATATTT AGTTAGACAT CTAACGAAAT GGTGGTGCAA TAAATGGAAT 4 20 

TCACTTATTC GTATTTATTT AGAATGATTA GTCATGAGAT GAAACAAAAG GCTGATCAAA 4 30 

AG IT AG AG C A ATTTGATATT ACAAATGAGC AAGGTCATAC GTTAGGTTAT CTTTATGCAC 54 0 

ATCAACAAGA TGGACTGACA CAAAATGATa TTGcTAAAGC ATTACAACGA ACAGGTCCAA 600 

CTGTCAGTAA TTTATTAAGG AACCTTGAAC GTAAAAAGCT GATCTATCGC TATGTCGATG 66 0 

CACAAGATAC GAGAAGAAAG AATATAGGGC TGACTACCTC TGGGATTAAA CTCGTAGAAG 72 0 

CATTCACTTC GATATTTGAT GAAATGGAAC AAACACTCGT ATCGCAGTTA TCTGAAGAAG 780 

AAAATGAACA AATGAAAGCA AACTTAACTA AAATGTTATC TAGTTTACAA TAAATGATAA 84 0 

GTGTGACTGG T AG AAAT C AG TCACTTTGTC TTTAATATTA TAGTTAGATA TCTAATTGTT 90 0 

4S AGTAAGCTAA TTATTGGAAA AGACAAGGAG TATTGAACAA TGAAAGACGA ACAATTATAT 96 0 

TATTTTGAGA AATCGCCAGT ATTTAAAGCG ATGATGCATT TCTCATTGCC AATGATGATA 1020 

GGGACTTTAT TAAGCGTTAT TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 1080 

50 GAyAGCCACA TGATTTCTGC tAatCTCTCT AACACTGCCA GTATTTGCTA TCTTAATGGG 1140 

GTTA 1144 
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w 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 

ACAACCAATT TTACTAAACA TGGTrrTCAT gCATTTTCAA TATTTAGATG TCGACCGCAT 6 0 

AATCGAAGAA TCGCCGACAA TAGTACTTAT CGATGAGTTA GCACATACGA ATATTTCTAG 12 0 

AGATCGTCAT GAGAAACGAT ATATGGATAT TGAAGAAATT TTAAATCATG GTATCGATGT 180 

15 

TCATACCACT TTGaACATTC aaCATATTGA AAGTTTAAGT AGTCAAATTG AACTGATGAC 24 0 

CGGTGTACAT GTTAAAGAAC GTGTACCCGA CTATTTCATA ATGAGCGCCG ATGTATTAGA 30 0 

20 AGTCGTAGAT ATCTCACCTG AACAATTAAT TAAACGCTTA AAAGCTGGCA AGGTATATaA 3 60 

AAAGGATAGG CTAGATGTAG CATTTAGTAA TTTCTTTACG TATGCCCACC TAAGCGAAtG 42 0 

CGTACATTGA CGTTAAGAAC AGTTGCCGAC TTGATGAGTG ATAAAGAAAA AGTCCGACAC 4 80 

AACCATAAAA CGTCACTCAA ACCTCATATT GCTGTGGCAA TTAGTGGGAG CATTTATAAT 54 0 

GAAGCAGTAA TTAAAGAGGC ATTCCATATT GCTCAAAAAG AACATGCGAa GTTCACTGCT 600 

ATTTATATAG ATGTATTCGA AAAAAACAGG CAATATAAAG ATAGTCAAAA GCAAGTGCAT 6 60 

30 

CAACATCT C A TGCTTGCAAA ATCATTAGGA GCAAAAGTAA AAGTAGTTTA TAGCCAAACC 72 0 

GTTGCATTAG GATTAGACGA ATGGTGTAAA AATCAAGATG TAACCAAATT AATTATCGGA 78 0 

CAACATATTA GAAATAAGTG GCGAGACTTT TTCAATACAC CTTTAATTGA CCATTTAATG 84 0 

TCCTTTGAAC ATAG C TAT AA AATCGAAATC GTTCCAATCA AACAAATACC TGTTGAATTG 900 

AAAATGAACA AATCACCCTA TCGTCCTAAA GGCAAACGTT TCGCCATAGA TATGTTAAAA 96 0 

ATGATTTTGA TTCAAATAAT TTGTGTAATG ATGGGACTGT GGATTTATCA ACTTGATAAG 102 0 

CATGAGTCTA GTACGATTAT TTTAATGATT TTTCTCATCG GCATCATTTT ATTATCCATT 1080 

TGGACGCGGT CCTTCATCAT TGGCTTTTaG CAGCAATTAt TAACGTATTT GTgTkTAATT 114 0 

ATkTTTTtAC GGAACCTA 1158 
(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA CGTTTAAACA 60 

5 TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 120 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 24 0 

W TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 3 00 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 360 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 4 20 

15 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 4 30 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 54 0 

ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 6 00 

20 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGCCGC 6 60 

GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

25 TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 7 80 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 84 0 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 90 0 

30 AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 96 0 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AG AG C TTGGT 102 0 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGc TAAAGCAACa 108 0 

35 aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGTG 114 0 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 1200 

GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 126 0 

40 

GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTGAATT ATTTTATGGT 13 2 0 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 1380 

AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 144 0 

45 

ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 1560 

5Q GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 1620 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 1680 
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AGTGCACATG ACTAATTAAG TTTTGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTGTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAGG TTGGGGATTA I860 

5 GCATAATGGG ATTGTGCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 198 0 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 204 0 

10 GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 216 0 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 222 0 

15 

ATTT 2224 

(2) INFORMATION FOR SEQ ID NO: 4 86: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

ACATTACnTT GAATCGAAAG TTTCATAAAC GACTTGAATG CCAGTTTCTT TTTCAAATTT 6 0 

30 CTTAATTAAC TCTGGATCAA TATATTCGCC CCAATTGTAT ACGTAAATTT TTTGATTTGT 120 

ATGCACTTGT TCTTTAGATT TAAACCAATG ACTTAAAGTA AGACAAAGCA TACCCACAAC 180 

TAATGCACCT ATAATGAGTT GTAAAAATCG TTTCATTATT TTACACCTCG CTTGATTAGT 24 0 

35 TTTTTCTTAT TTATCACGTA TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 300 

GCAAACAATA ATGTTGAAAT CGCATTAATT TCCATACTAA TTCCTTTTCT CGCCATAGCA 36 0 

TAAACTTCAA CTGATAACAC ACTAAAGCCA TTACCAGTAA CGAAGAAACT TACTGTGAAA 420 

40 TCGTCTAGTG AATAAGTTAA AG C CATAAAG AATCCTCCTA TAATAGAAGG TAAAATATTA 4 80 

GGAATAATAA TGTTGCTTAA TAATTGTGGT TCAGTCGCTC CTAAATCTCT TGCAGCATTT 54 0 

AACATATTAT TATTCATyTC ATACAGTTGT GG TAAGACG A TAATCACAAC TATAGGTATG 6 00 

45 

CAAAATGCAA TATGAGATAT TAGAACTGTC CaAAAkCCTA AACCAAGACC AG T AAAATGG 6 60 

CCAATCGTTG TAAACATAAT TAAGAATGAT GCACCTATGA CAACGTCGGA TGATACCATC 72 0 

AAGACATTAT TCAATGTTAG TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 7 80 

50 

GCAATGGCAC CAAATGTACC AATAACTGTA GAAATTGAGG CTGCTAAAAG TGCTACAGCT 84 0 

55 
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AATGTAAAAT GTTCAAAGTG AATCATATTA CCAGCCGAAT TGAATGAATA GAACATTAAA 960 

AAGAATATTG GGATGTATAA AATCGCTAAA AGTATCCCGA TATACAGCTT TCCATACCAT 1020 

5 TTCATATGAT TCACCCTCTC CCATTAGATG ATTTTGTAAT GATTAAAATG AATGCCATAA 10 80 

ATACAATTAA GAATATAGCT ATAGTTGATC CCATACCATA ATTTTGAATT GTTAAAAATT 1140 

GTTCCTCTAT TGCCGTACCT ATATTTATGA CTTTATTACC TGCAATTAAT CTTGTAATCA 1200 

w 

TAAATAATGA AAGTGATGGA ATAAAGGTTA CTTGAATCCC AGTCATAACA CCTTCTTTTG 1260 

TTAACGGCAT GATTACTTTT CTAAAAGTAT AGAAAGGACT GGCACCTAAA TCACTTGAGG 13 20 

CCTGCAATAA ATTATTAGGA ATTGCTTTCA TGCTATTAAA TATAGGTAAA ATCATAAATG 13 80 

15 

GTATATAAAT GTAACTTGCC ACTACTAAAA ACGCACCAGT TGTAAATAAC AAATTGAATG 144 0 

ATGGTAAATT AAATAAGTGG AAAATTGATT AATCACGCCA TCATGACTTA ATAAACCTAT 15 00 

AAAAG CAT AT GTCTTTAACA ATAAATTTAT CCATGTTGGA ATAATCATTA TCATTAATAA 1560 

20 

GATATTTTGA AATTTCGAAC GAGTAATATA ATAGGCAGnT GGATAACTGA TAGTCAAGGT 1620 

AATAATTGTT ATTGAAGCGG CATATAAAAT TGAATATGCA AACATTTTCA AATATTTTGT 16 80 

25 AGTAAAAATT 16 90 

(2) INFORMATION FOR SEQ ID NO: 4 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Jb (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 87: 

ACGAAAAGAA ATATTATGAT GAACAAAAAG AAAGAATAAC GATTTATATG AAGTACAATG 6 0 

TGAAAGGTTA TAAAAATATA AGCTTCGCTA ATTTTAAAGA AAACCCAATG GATGGTTATT 120 

40 

CTATTAGTGG TTATATAAAT AAtGaTAAAA AGTTATCATT TACAGCTGGT ATAAGATCTG 180 

TTGATGATTT TCAATTTGAT ACCGATATTT CTTATACAGA TGAATTGGGT AGAAAATTTA 24 0 

ATAAAAATCC TAAGTCAGTT TCTGAAATAA AAAAAGAGCA AAATACGTCC AATAAATAAT 3 00 

45 

TGTTCATATT GTGATGAAAC AAAAATATAA GTCATTAGAT GAGTTTAACT ATGTTATAAA 360 

TATTTGTAGT ATCTATAAAA ATCTCGACAC TATTAAAATG ATAAAGTGCC GAGGTTTTCT 420 

5Q TACTTATTTA GTTAATTCAA AGTTTATGCC AGATTCATAA GAATTTGTGA CACTTTTAAT 480 

AGTGTAcCAT TGATTATTAC AATTTATCAA ATGGTCCTTT AGAAGGTATA AATAACAAAA 54 0 
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TATTATTATG TTCAAAACTT TACGCTCCAA AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 66 0 

TTGCTTAaCT TCGGaTATTG AACGCATCAG T C CAATTTG A CATAGAGCCT TTTTTAGTTC 720 

TTGATGTTTC TCTTTAAAAC CTTGCATATT TTACAAAAAG AAAAATTAGC AGTATAATTA 780 

AGACAACGAA AATAAGTATT TACTTATACA CCAATCCCCT CACTATTTGC GGTAGTGAGG 84 0 

GGATTTTTAT TGGTGCGGCT ATATGTCACC TATTTTGTAT TGCGTCTACT TAGCCAATAA 900 

GAAAAAAACG CAATGGCACA GCCACTGATG ACTGGTGCTA TGATGTGAAC GaAAATAAGC 96 0 

ATCACCTTAT ACACCTCCTC TCTGCGTCTA AATTGACGsC TGAGaGrTAG GcGACTCTAC 102 0 

TATTATATCA TCGGCAAATA TACAAGCACA GTCACTTGCT TCTGATAAGT TATATGATTC 1080 

TAGCTGATAG ATTGAATCGT CTACACTTAA TTGGACAAAT TCTATGAGAA TAGATATTGT 114 0 

TAATTTAAGA AAGTAGGCGA TTTTATTATG ACAAGAGAAA GAAGATCATT TAGTTCAGAG 12 0 0 

TTTAAGTTAC AAATGGTTAG ATTATATAAA AATGGTAAGC CTAGGAATGA AATTATACGC 126 0 

GAGTATGATT TCACACCTTC GACGTTTGTA AATGGCGGTT ATAAAATG T A GGAAAATGGA 13 2 0 

TAAAGCAACA TCAAAACACG GGTACATTCA ATCACCAAGA TAACTTATCG GATGAAGAAA 13 80 

25 AAGAGCTGAT TAAATTACGC AAAGAAGTTC AACATTTAAA AATGGAGAAC GATATTTTAA 14 4 0 

AGCAAGTAGC GCTGATTATG GGGCAAAAAT AGAAGTCATT CAAAAGAATG CACATCAATA 150 0 

TTCAGTATCA GCAATGTGTA AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 156 0 

30 AAAAAGAAAA GATAATAAAA TCACTAAAGA TGATTCAAAC ATAGAACATG CCGTCATAAA 162 0 

TATTTTTAAT TCTAATAGAA AAGTCTTTGG TACAAGACGA ATTAAAAATC ATTTAAATGA 1680 

CAAGGGTCTC ACTGTATCTG GACAAAAGAT AGGTCGATCA TGAAAAAATC TAGTTTCTGT 174 0 

35 TT AT A CG AAA TCTAAATACA AAAATCATCT AAAAGAAACT AATGAAAAAC GAATTAAAAA 18 00 

TCTTTATTAT TAGCTGCTGG TGTATTATTT GTTAGTCCAA TTTCATTATC TTATAATTCA 1860 

GATGTAGCTC ATGCTGAAGA TAAGTTAGAC CATTCTCAAG CAAAGGTAAT ATATTTGAGT 192 0 

AACCAAAATT TATTTGATGA ACTTGAGAAA AAAGGTTATA AACTGGAAGA TATATTTACA 198 0 

AAAGAAGAAA TAAAAAAATA TAAAGCTGAA GACCAATTGA GAGCGGGTAA AACTGAATAT 204 0 

GTAGAAACAG GTAAAGATAC TGCAACATTA TATCTTTCTT CTGCATATAC AAAAACAATA 2100 

GCTGCTTTAG GT 2112 
(2) INFORMATION FOR SEQ ID NO: 4 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

5 GTAGGCACAC AT CTGC CAT A TAAACATTCT TTTATACTAG TGTTCTCATA TAG TGT AG AG 6 0 

TTATAGTCTC CTTCTTGAAT CTCGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 12 0 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 180 

10 GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 24 0 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 300 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 36 0 

15 CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 420 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 4 54 



(2) INFORMATION FOR SEQ ID NO: 489: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 

TTGTCAGAAT TAGAATGCTT TTGAGTTACT TCAT AA TACT CATCAGTTTT TTGTGTATCC 6 0 

30 

TTTTGACTTT TATTTATTTC TTTCCACTTA CCAGTATGAC TTTCTTTTTT TACAGTTATT 120 

TTCGGTTTGT TTAAAAAATG ATAACCGATG TTCTTTTTGG TATCTGTGGA CTCTAAAAAG 18 0 

ACTGAATTGT TTTCCTGATT ATCAGAATTG GTTGTTTGTT TATCGTCTGT ATATAACGTA 24 0 

35 

TACCCATTCG CTTTGCGATT TTCAATCGTT GTAACTGGAT TCTTTGATGA ATCAGTACTT 300 

TTAATGCCAG TTCCTAAGAA GACAATTTTA TCGTTTAATA TGAAATATGA TTTTTTGGCA 36 0 

4Q GTTAAAGTTT TGTCCTGATT TTCAAAATCC ATTCCGATAC TAGCATGTTG GTCATCAACT 420 

TTTGTTCCGC CAACAAAAGT TTTACTCGAC TTTTTATCAT CCGTATCTTT TAATATTTCA 4 80 

TTGTCTAAAG TTGTTGTACC TGATAAACGT TTCATATCGG CTGTCACCCA GAAGTT AT CA 54 0 

45 TGATAGTGTT TGACATCGCT GTTATATAAA TAAGACATTC CAGCACCAGT GTGCCAACCT 6 00 

TTTAAATTCT CTCCGTTGAT ACTTTCATAG CGTGCTACGT TTTTCGACGT CATACTTAAA 6 60 

CCAAATGCAA AGTCTAAGTC TTTGTTATGA TAGGTGACAC GATCCATGTC ATTATATATT 720 

50 TTAAGTTGTT GTGTTAATCC GTTTTTAGAA ATACTGTTAT CTGTCATTAA AGACTTCATT 780 
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ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AAT CATC CAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

5 CTGATAGCTC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAA GGTGTT 10 8 0 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 114 0 

10 GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCATCTT TATAAAATCC ATTACGTTCT 12 00 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 12 6 0 

TT CAT CAT AT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 132 0 

15 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 13 72 

(2) INFORMATION FOR SEQ ID NO: 4 90: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 



ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC CAATTTGTTC GTCTTATATA 6 0 

30 AATGTACTTT aCCTTTATTT TT A CAAAAAT AGCATTTTCC TATGTCATTT AACTAAACAT 12 0 

GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATTCT AGGAAAATGC TTTTTTCTTT 180 

TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT AGTTGCATTG TACTGTTGTG 24 0 

35 CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG AGTATTATTT AGTAATTGGT 3 00 

CGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC TAAATAGCCA TTAAATCCAT 3 60 

CTTGaATCAG TTCTGaTGGA C CAT AAT CT A CATCATAACT GATCACTGGT GTACCTACTG 4 20 

40 AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 4 80 

CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC ATGAAATTTA ACATGTTCCA 54 0 

GATATnATGA TAATCTTCTA CAAG 564 



45 

(2) INFORMATION FOR SEQ ID NO: 4 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
50 (c) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 

TATCCACCCC ACGAnAGCCC CGGAAACTTA TTGTGTTACA AGATATATAA GCAGAAACGA 60 

ACAACAGTTA ACAAAATAAA TGAAATTAAA CGTTTTAAAA ATGAAACAAA TGAAATCATC 120 

TATTAGGTTA TGAAACTGTT TATAGCTTGA ATAGAAGCAT TTATTTTTTA GGAGGACAAT 180 

TATTATGCGT CAAACATTTA TGGCAAATGA ATCAAACATT GAGCGCAAAT GGTATGTTAT 24 0 

CGATGCTGAA GGCCAAACAT TAGGTCGTTT ATCATCAGAA GTAGCATCTA TCTTACGCGG 3 00 

TAAAAATAAA GTAACTTACA CACCACACGT TGATACTGGT GATTATGTAA TCGTTATTAA 3 60 

TGCATCAAAA ATCGAATTTA CTGGTAACAA AGAAACTGAC AAAGTTTACT ACCGTCACTC 4 20 

AAATCACCCA GGTGGTATCA AATCAATCAC TGCTGGTGAA TTAAGAAGAA CTAACCCAGA 4 80 

ACGTTTAATT GAAAACTCAA TTAAAGGTAT GTTACCAAGC ACTCGTTTAG GCGAAAAACA 54 0 

AGGTAAAAAA TTATTTGTAT ATGGTGGCGC TGAACATCCA CACGCTGCAC AACAACCAGA 60 0 

AAACTACGAA TTACGTGGTT AATTAGAAGG AGGAAATGAC TTTGGCACAA GTTGAATATA 6 60 

GAGGCACAGG CCGTCGTAAA AACTCwGtAG CACGTGTACG TTTaGTACCa GGTGAAGGTA 72 0 

ACATCACAGT TAATAACCGT GACGTACGCG AATACTTACC ATTCGAATCA TTAATTTTAG 780 

ACTTAAACCA ACCATTTGAT GTAACTGAAa CTAAAGGTAa CTATGATGTT TTAGTTAACG B40 

TTCATGGTGG TGnTTCACTG GACAAGCTCA AGCTATCCGT CACGGAATCG CTCGTGCATT 900 

ATT AG AAG CA GATCCTGAAT A CAG AGGTT C TTTAAAACGC G CTGG ATT AC TTACTCGTGA 96 0 

CCCACGTATG AAAGAACATA AAAAACCAGG TCTTAAAGCA GCTCGTCGTT CACCTCAATT 102 0 

CTCAAAACGT TAATTGTCGG ACGATATATA CAAAACACCT CGATATTATG TCGAGGTGTT 10 80 

TTTTTGGCGT TTTTGCGGCG AATATGGAAT GTGTAGAATA TAAATGAATT TTTACCTTCC 114 0 

CACCATAAAA GATGAAGAAC CATGAATGTG GAGAACAATA AATAGTTGGA TATTCTGTTA 12 0 0 

TTTTTTTGGA AGTGGAAGTG GATTTGGAAT ACTTTACTCn AAACGATTAA AAGGTTTAAA 12 6 0 

AAAACAACAA AnAGAAA 1277 
(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 
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TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAGACA TTTTCTTTAT GAAATAATAA 12 0 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 18 0 

5 AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 24 0 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 3 00 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

10 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 42 0 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 4 80 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 54 0 

IS 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 6 00 

aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 6 60 

GTTTACCAGT ATC 6 73 

20 

(2) INFORMATION FOR SEQ ID NO: 4 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA AATATACTTA 6 0 

TGAAGTTATT AACGCTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAACGA 120 

35 CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTCCCATAGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 24 0 

TGTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 3 00 

GCTTTTTGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGGCGA 3 60 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATGACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 480 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 54 0 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 600 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 660 

GAACCATTTA AAGATGGTAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 72 0 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT 84 0 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 960 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 102 0 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 108 0 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 114 0 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAAmCG 1200 

aTTGGAACGa TTGGCGATtC TGTAGCTAGA GGATCACATG 124 0 



15 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 94: 

ACGGTGGATT TAG C CTAGAA GATTTAACGC ATCAAGGTAa ATTaTCAGCA TTTAGCTTTA 60 

ATGATCAAaC AGGTCAAGCA ACATTGATTA CTAATGAAGA TGAAAACTTC GTAAAAGATG 120 

30 AGCAACGTGC TGG CGTAG AT GCAAATTATT ACGCTAAACA AACATATGAT TATTACAAAG 180 

ACACATTTGG TCGTGAATCA TATGaCAACC AAGGTAGTCC AATTGTTTCA TTAACGCATG 24 0 

TTAATAACTA CGGTGGTCAA GATAACAGAA ATAATGCCGC ATGGATCGGT GACAAAATGA 300 

35 TCTATGGTGA TGGTGATGGT CGCACATTCA CAAGTTTATC GGGTGCAAAT GACGTAGTAG 360 

CACACGAATT aACACACGGT GTGACACAAG AGACAGCGAA CTTAGAATAT AAGGACCAGT 420 

CAGGCGCTCT AAATGAAAGC TTTTCAGATG TTTTTGGATA CTTTGTAGAT GACGAGGATT 4 80 

40 TCTTAATGGG TGAAGATGTC TACACACCTG GAAAAGAGGG AGACG CTTT A CGCAGCATGT 54 0 

CAAACCCAGA ACAATTTGGT CAACCAGCTC ATATGAAAGA CTATGTATTC ACTGAAAAAG 600 

ATAATGGTGG CGTACATACG AATTCTGGAA TTCCAAATAA AGCAGCTTAT AACGTGATTC 66 0 

45 

AAGCAATAGG GAAATCTAAA TCAGAACAAA TTTACTACCG AGCATTAACG GAATACTTAA 720 

CAAGTAATTC AAACTTCAAA GATTGTAAAG ATGCATTATA CCAAGCGGCT AAAGATTTAT 780 

ATGACgAGCA AACAGCTGAA CAGGTGTATG AAGCATGGAA TGAAGTAGGC GTGGAGTAAA 84 0 

50 

AATATATAAA CaAGAAGAAG TAATGTTAAA CA C TT AT AAA TAATTAAATT TTAAATACAG 90 0 
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ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 102 0 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 114 0 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA CGTAACTACC AAAGGAGTTT 12 00 

ACAATATGAA GAAAAGTAAA CGATTAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 12 6 0 

TTTATAAAAA AGAACAAATc ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 495: 
TGCACTTTCT AAAAATACTT GCTTTACTTG TTCCAATTCC TTGTCAGACA TTGATTCGGC 60 

TAATTT ATT C ATTATTGCCT CAAATACGTT AATTATGTCA TCCATTTCAG TACTATAAAA 120 

ATCAAACCAT TTTGCCGTAT CTTTTTCTCT ATTAAGCTTA TGATCACTTT GAGAGCGCTT 180 

AGCTAATTCT GCATAAATAT ATGGACAAGG TGCCATTGCA GCAATTGTAT AAATAGCATT 240 

30 TTCACGACTA TGCGCTTGGA AATACATATG TTTTATG TAA TGGTCGCCAC TTGGAGGCCA 3 00 

AACTTTTGTT TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA AAATATCATG 3 60 

CGCAAGTACT TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT 420 

35 CATGCTATTC ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC 4 80 

AGCATCAGCT TTTAAGTAAT GGCGTAATGC GTCAGcTCCT ATATCTCCGG ATAACATCTT 54 0 

CTGAATAAAG TCATCCTCAT AAAT AT CATT AATGATTGGC TTTGCAGCTT GGTACAATTT 6 00 

40 TTGTGAAAAT TCCATTGTAA AAAATCCTCC CTAAATAAAA AAACTACTTC CAACATGAAA 6 60 

GTAGTTTGAT GGCAATGTTG CTATACTAGC CCCATCACTT CAATAACTAC TTTCCTACGT 720 

TGGTACTAAC CAAATCAGGT CATAAGGGTC TGAACAATTC ATCTCAGCCA TATCATTAGG 7 80 

45 

CTCCCCTAGT AGTTCCTTAG TATTCAATTG CAAATTAATC TTAGCAAACG GTTTCAACAT 84 0 

TTTCAATTAT TGTTGCTCAG TTGTATTATT ATCTTTAAAT AATAATTCTA TAATGACATA 900 

TATTTGCGAA ATAAAAAAAC CGGAACATAT CGAGAATTCC CCGATATATT CCAATCTAAA 96 0 

50 

AGTTACTTAT ATAACTATTA ATTAGCTATG CATAAATGGC TTATGCAGTA ACCCAATGTC 102 0 
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TTGCTGGTGA TACACCTTTA TATTTAGCAG GTGCTACTGA ATCCCAAGTT GATTGTAAGA 114 0 

ATTGATACTT ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT GCCACCTGaT 12 0 0 

5 TCACGTTGrG CAATTTGTTT TAGATGAGCa TTCACATTTA CTGATGAACC TTCTGATGAT 126 0 

TTTGATyCAG TTGGTGTTGC AGTAACTTGT GAATTGTTTG ATGTTGATGC TTGTGGTTGT 13 2 0 

TGAGTTTGAG CATTTTGTGG TGCTTCAACT TCTTGTGATT GTACTTGATT AGCTTGAACA 13 8 0 

10 GCTGATGGTG CAACATTATT AGTTGCAGGT GCTTGTGCAC TCATGTCTGC TCCATTAGTA 144 0 

CCTGTTGCAT GGTAATTCCA AGCAAAGTGT GTACCATCTG ATTCAAAGTG ATAAGTAAAC 150 0 

CCTTCATAGT CAAATGTATA ATTATAAGCC CCAGCTTCAA TTGGTTTTTG ATTTAATGTT 156 0 

IS 

TGATCATTTG ATTGCGCCAT TTGCCTGAAA GATGCTTTAT TTAAGTCCGC TTCACnTGCA 162 0 

TGGGCTTCGT GGACCTGCAT TTCCTGGCTA CGATTCCTAA ACCTACTGGC nAAnGATGAT 16 8 0 

GCGAGTAATG TTTTCTTCAT AATCTTAAAA TCCTCCTACA AGTGAA'ITTG TGTCTCTAAA 174 0 

20 

AGTTTTACAG TGGACGACTG T 1761 

(2) INFORMATION FOR SEQ ID NO: 4 96: 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

TCATTTATGA AAAATGTCGG AnAGAnCCaa GaAAmaCAAT TGAgCGTGAA GAAAAAGCAA 6 0 

35 GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACAAATAAAA GATGTGAGTG 12 0 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA 18 0 

GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 24 0 

ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATT AA CAG AA CAAACTCATA 30 0 

ACAGTGTTGA AAGTGAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATCGTATG 36 0 

TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 42 0 

AAGTGCAACG TAAAGGACAA GTACTAGAGA ATACATTAAA AGATTTTGGG GTAAATGCAA 4 80 

AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 54 0 

GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 60 0 

AAGATGTTAG AATCGAAGCG CCAATACCTG GTCGTTCTGC AGTAGGTATT GAAGTGCCAA 66 0 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 780 
ATGAAATGCC ACAC 7 94 

(2) INFORMATION FOR SEQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: ! 

ts 

AGCCAGTTTT GcATTTCGTC AAATCGCAAT 
AGAAATTTAT TCTTCTTTTT CTTTTCAGCA 
TCTTAGTGAT AAAGAATCCA TCTGAATTAA 

20 

CCAACTCTCC AGTTATCGGA TGTTGAAACG 
AAAACGTATA AATCACGTTT TCATTTTCTA 
TTTCTCCACC TATTTTTACA TTGTTTTTTA 

25 

CTAGTGACTC AATATGTTGT TTGCTTTGAG 
CACCTAATCC GCTACATGGT GCATCAACAA 

30 TTGTCGCATC ATGTTGAAAA GCTTTAATAT 

TAATTAAGTC TATTTTGTGA TCATGTATAT 
AAACTTCAGC AATGTGACAA GCTTTACCGC 

35 CATGTCGGTC TACATT CAT A ATGTGTGCAa 

CGAATCCATC TTTAAATGAA CGAGAATGAA 
AAGGTAAGTC ATGATCTTTT TCAACGTCAT 

40 CATCTAATGA TGCTCGCGTC AGGTTGGCAC 

ACTGTAAAAT TTTTTCAGTT TCTTCGAGAC 
ACTTCGGCAT ACTATACTCG ATTGCCATTC 

45 

CAGGTAAGTC ACTACGCATC ATTGTACGTA 
AGCCACCGCG TTCTTTTGCT ATTTCAACTG 
TATCTAnATA nACATATTGA T 

50 

(2) INFORMATION FOR SEQ ID NO: 4! 



IEQ ID NO: 4 97: 

GAATATATTG ATTGCTTGTC AAAATTTGGA 6 0 

GTTATCATTC CTAATTGTCC TTTCTTTTTA 120 

AGTCTTGCGG CATGATTTGT AACGTTTTGA 180 

GTTCAAATTC GAAGTTTTTA TTATTTTTCA 24 0 

GTTGCTCAAT TGTACATGTT GAATAGATGA 3 00 

CATTTTCCAA TATTTCAAGC TGTAATTCAA 3 60 

TATACTTAAT CTCCGGCTTA TGTCTCATTA 420 

GTATCTTATC GTATGTTTTA TCATAAGGTT 4 80 

TTGTTAATCG TAATTnTTTT ATATTAAAAT 54 0 

CTGAAGCGTC AACTTGCCCT TCTGGCATTA 6 00 

CAGGTGCACT ACATGCATCT AATACGTGAT 660 

CAAACATTGA GCTTTTATCT TGAATTGAAA 720 

TAATTGGTTG TCCTCCTATA TGGAGACAAT 7 80 

AACCTTCGTC TTGCAACTTT TCAATAATAT 84 0 

GCACAGTTGT TGATGTCGTT TCTAAAAATG 900 

CATAATGTGT TGCCCAATGA TCTATAATCC 96 0 

TTTTTTTAGG ATCTGCAATT TCATTAAAAT 1020 

AAATACCATT TACGACATTA CCATTATGAT 1080 

CTTCATTAAT AATGGCATGA TTTGGAACTT 114 0 

1161 

18 : 
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w 



20 



(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

AGCTCACGTC ATCTTCGGCG GCGCTAAATT AAAATAATCA ATTTCTGAGT TAAACTTTTA 6 0 

TTTACAACAT ACTATTACTA TACATTACAA ATTTTTAAAA TATACATTAC ACTCATTACT 120 

CAATGGmAAG CGTATGATTT CmCAGCCCCC CTAGCTTGTA GAAATCATAC TTTCCTTTTT 1B0 

IS TCAATATATA TACAACTATT AAATCCCATA AGATTGCAGA GCACATAAGT AAATTTTTTT 24 0 

AGAGCTTGAG GTTTGTTTAG CTTAAGCAAC CCATGAGCTC AAACACTTCC TGTTCACATA 3 00 

ACACTACAAA TCGCATTATG TTGCTTAATC TTATGTTTAT ATAAATTACA CACAATAAAT 360 

AGAAAGAATG TGAACATCAT GAATAAATTA TTGCTACTCG TTACATTTAT CATTCGTGTG 4 20 

GGTTCAGGTA TTGTTATGTT AATGCAAGGC TACGAAAAAT TAACGGGCGG ATTTACGCTG 4 80 

AAAGGTTTAG TACCAGTCAT CGCTAACAAT ACTGATTCAC CAGAGTGGTA TAAGTGGTTT 54 0 

TTCGCAAATA TAGTTGCACA TACGACGTCA TTATTTGATA TTGTTGTCCC ACTCGGAGAG 600 

ATTGCAATTG GATTAGGTTT AATTTTTGGA GTTTTTGCAT ATGCTGCTAG TTTCTTTGGA 660 

GCCTTTGTTA TGATAAATTA TATCTTAGCA GATATGATAT TTACGTATCC TCTTCAATTA 72 0 

ACTTTCTTTA TCCTTTTACT AATGAGTCAC TCATTGTTAA AACAGATTTC ACTTAAAGAA 7 80 

ATCATTAATT ACTTTAGAGG TCGTAAGAAC AGAGGTGAAA AAATAGATGA CCCACTTACT 84 0 

GATCGTGGAT GATGAACAAG ACATTGTAGA CATTTGTCAA ACCTATTTTG AATATGAAGG 9 00 

TTACAAAGTA ACAACGACAA CTAGCGGTAA AGAAGCAATT TCTTTACTAT CAAATGATAT 960 

TGATATCATG GTACTTGATA TCATGATGCC AGAAGTTAAT GGTTACGACA TTGT CAAAG A 1020 

AATGAAAAGG CAAAAATTAG ATATCCCCTT TATCTATTTA ACTGCCAAAA CACAAGAACA 10 80 

TGATACCATT TACGCCTTAA CTTTAGGTGC AGATGACTAT GTCAAAAAAC CATTTAGTCC 114 0 

AAGGGAACTC GTTTTACGTA TTAATAATTT ACTTACAAGA ATGAAGAAAT ACCATCATCA 12 00 

45 

ACCAGTTGAA CAACTGTCGT TTGATGAATT AACACTTATT AACTTAAGTA AAGTtGTGaC 12 6 0 

tGTAAaTGGT CACGAaGTCC CTATGCGTAT TAAGGAATTT GAGTTATTGT GGTATTTAGC 1320 

50 TTCTaGAGAA AATGAAGTTA TTTCTAAATC AGAATTACTT GAAAAAGTTT GGGGATATGA 13 80 

CTATTACGAA GATGCTAATA CCGTGAATGT C CAT AT AC AC CGTATTAGAG AnAAATTAGA 14 4 0 

AAAAGAGAGC TTTACAACAT ATACCATCAC AACTGTATGG GGATTAGGAT ATAAATnTGA 15 00 
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(2) INFORMATION FOR SEQ ID NO: 4 99: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 

ATTGAAAGCG ATAATTCGTA nTAATTGAGT TTGTTGAAAA ATTTAGGGTA ATGTAAAGAT €0 

ATAAAAGATA CATAGAyTGG AGAGATATAA AGATGTTGAA TGAGATACAA ATATTAaATA 12 0 

15 

aTGGATACCC GATGCCTTCA GTTGGGTTAG GTGTTTATAA AATCTCTGAC GAAGATATGA 180 

CTAAAGTTGT AAATGCTGCA ATTGACGCAG GCTATAGAGC GTTTGATACA GCATACTTTT 240 

2Q ATGATAATGA GGCTTCACTA GGACGAGCAT TAAAGGATAA TGGCGTCGAT AGAGAAGATT 3 00 

TGTTTATAAC AACGAAGTTA TGGAATGACT ATCAAGGTTA TGAGAAAACA TTCGAATATT 3 60 

TCAACAAATC GATTGAAAAT TTACAAACTG ATTATCTTGA TTTATTTCTA ATACATTGGC 420 

25 CTTGTGAAGC AGATGGTCTA TTTTTAGAAA CATATAAAGC TATGGAAGAA CTTTACGAGC 4 80 

AAGGTAAGGT AAAAGCAATA GGTGTATGTA ATTTTAATGT TCATCATCTA GAAAAATTAA 54 0 

TGGCTCAATC AAGTATCAAA CCAATGGTGA ATCAAATTGA GGTACATCCA TATTTTAACC 600 

30 AACAAGAATT ACAAGAATTT TGTGATCGTC ACGATATTAA AGTGACTGCA TGGATGCCTT 66 0 

T G ATGAG AAA TAGAGGACTA CTAGACGACC CTGTCATTGT TAAAATTGCT GAAAAATATC 720 

ATAAAACACC AGCACAAGTT GTATTACGTT GGCATTTAGC ACACAATAGA ATTATTATTC 780 

35 

CAAAATCTCA GACACCTAAA CGCATTCAAG AAAATATAGA TATTTTAGAT TTTAATTTAG 840 

AATTAACAGA AGTAGCTGAA ATTGATGCTT TAAATAGAAA TGCAAGACAA GGTAAAAATC 900 

4Q CAGATGATGT GAAAATTGGG GATTTAAAAT AACTGGATGT TAAATTTTAC GTTTATGAAT 960 

GCCTTTTAAT GTGTACATTA AAATAAATGA GTTGGTTTTT ACTATTTGAT AAAACAATAC 1020 

TCAGGTACAT TCAAAATCTT TTAAATAAAA AGGATGGACA TAGATGAAAA TTAGAGTCGT 1080 

45 CATTCCTTGT TTTAATGAAG GGG AAGTCAT TACACAAACA CATCAACAAT TAACTGAAAT 114 0 

ACTTTCACAA GATAGTAGTG TGAAAGG CTA TGATTATAAT ATGCTTTTCA TAGATGATGG 1200 

TAGTACGGAT ACCACTATAG ATGAAATGCA ACATCTTGCC ACAATAGATA GGCATGTCAG 1260 

50 

CTTTATTTCT TTTAGTAGAA ATTTTGGAAA AGAAGCAGCT ATGATTGCAG GTTACCAGCA 132 0 

TAGTACTGAA TTTGATGCAG TCATCATGAT AGATTGTGAT TTGCAACATC CACCTGAATA 13 80 
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TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 1500 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 1560 

AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 1620 

TGA 1623 



(2) INFORMATION FOR SEQ ID NO: 500: 

w 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 60 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 180 

25 TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 24 0 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 3 00 

ATT CAT CATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 36 0 

30 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 4 20 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 4 80 

GAAAGAAGTA TTAATCAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 54 0 

35 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 600 

CATTT 605 
40 (2) INFORMATION FOR SEQ ID NO: 501: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 6 0 

GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 120 



55 



1426 



EP0 786 519 A2 



TGAGAGAAAA GTTTAGACAC TATTGTGAAC ATGAAGTTCT AAGTTCAATC ATCAATGGTT 24 0 

CATACATTAT CGTCAAAACC TCACCTGGTT TCGCCCAAGG CATAAACTAT TTTATCGATC 3 00 

5 AG CT AAA TAT AGAAGAGATA TTAGGTACGG TGAGTGGAAA TGACACTACA TTAATCTTAA 36 0 

CTGCCTCAAA TGATATGGCA GAATACGTAT ATGCAAAATT ATTTAAATAG ACATGTATCA 42 0 

AATGAATAAT AAAAATTTGT TTCGTATCAC GTGTACTCAA GTTAGTTACC AAATATTAAC 4 80 

w 

TTGTGTACGC GTTTTTTTAT GGAAAGAAAG AATTCATAGT CATTCAATTG ACTGTATAAA 54 0 

AAACTTTATA CAACATGTTT TTATGGGTAT TTTTGAATAA AAAATGTATA TTTTGACCCA 6 00 

AAATACCTTT ATTTATGTAT AAAAATCCAT TATTATGTAT TGTATAACAA AAAGATATGA 660 

AATTTT CGAC TTTCTTTATG TGAATATAAT CACATGTAAG CGTTTGAAGA TTGTCTATAC 72 0 

TCTAAATGAA TTCAAAGATA AAAGGAGGAA ATAGACATGA CAGATGGTCC AATTAAAGTA 78 0 

20 AATAGCGAAA TTGGAGCTTT AAAAACTGTG TTACTTAAGC GTCCTGGaAA AGAATTAGAA 84 0 

AATTTAGTAC CTGATTATTT AGATGGATTA CTATTTGATG ATATTCCATA TTTAGAAGTA 90 0 

GCTCAAAAAG AGCATGACCA TTTTGCGCAG GTGCTAAGAG AAGAGGGTGT TGAAGTACTT 960 

25 TACCTTGAGA AGTTAGCAGC TGAAAGTATT GAAAATCCTC AAGTAAGAAG TGAATTTATT 102 0 

GATGATG TAT TAG CAG AG T C TAAAAAAACA ATATTAGGTC ATGAAGAAGA AATTAAGGCA 1080 

TTATTTGCGA CACTTTCTAA TCAAGAACTT GTAGATAAAA TAATGTCAGG GGTACGTAAG 114 0 

30 

GAAGAAATTA AT CCGAAATG TACACATCTA GTAGAGTATA TGGATGATAA GTATCCATTC 120 0 

TATTTAGATC CAATGCCAAA CCTTTATTTT ACTAGAGATC CACAAGCCTC AATAGGACAC 126 0 

GGTATAACAA TCAATCGGAT GTTCTGGAGA GCACGACGAC GAGAATCAAT ATTTATTCAA 1320 

35 

TATATTGTAA AG CAT CATC C TAGATTTAAA GATGCGAATA TTCCAATCTG GTTAGATCGA 13 8 0 

GATTGCCCAT TCAATATTGA AGGCGGCGAT GAACTTGTTT TATCTAAAGA TGTCTTGGCT 14 4 0 

4Q ATAGGCGTTT CAGAACGTAC ATCTGCACAA GCTATTGAAA AGTTAGCGCG ACGTATTTTT 1500 

GAAAATCCGC AGGCGACGTT TAAAAAAGTA GTAGCAATTG AAATTCCAAC TAGTCGAACT 156 0 

TTTATGCACT TAGATACAGT ATTTACAATG ATAGATTATG ACAAATTTAC AATGCATTCA 1620 

45 GCCATTTTAA AGGCAGAAGG CaATATGAAT ATATTTATTA TTGAATATGA TGACGTAAAT 16 80 

AAAGATATTG CCATCAAACA ATCTAGTCnT TTAAAAGATA CTTTAGAAGA CGTACTAGG 17 3 9 
[2) INFORMATION FOR SEQ ID NO: 502: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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w 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

CTGT A CAT A C AGCAATATCG TTAACAAATG AAAACAGTAT TTTAGGATTG TAAACCATGA 6 0 

TAAACCTAAA ATACTGTTAT TTTTATTACT TAAATTTCTT CTTCAATGCC TTTTCAACAT 120 

AAGGTGGAAC GAATTCAGAA ATATCTGCTC GATAAGCTGC AACTTCTTTA ACAATACTTG 180 

AACTTATAAA TGAATAATTA GT ACT AG A CA TCATATATAA CGTTTCAATT TCATTGTTCA 24 0 

ACTTTTTATT CATTGAAGTT AAGCGTAATT CATATTCAAA ATCACTGACT GCTCTTAAAC 30 0 

CACGTATGAT TGTTTTAGCT CCTACTTGTT CACAATAATC GACTAGTAAA CCACTAAATT 36 0 

GATGAACCTT GACATTAGGT AAATGTTTAA CAGATTGTTC AATTAAATCC ATACGCTCTT 42 0 

CTAAACTAAA CGTACCTTCT TTTTTACTAT TTTTAAGAAC ACAGACATGA ATTTCATCAA 4 80 

20 ATCTATCTGT ACTTCTCTCA ATAATGTCTA AATGACCATA AGTAATGGGG TCAAAACTAC 54 0 

CCGGAATGAC CGCTATTGTA TGTTCCATGC TATTCTCCCT TTTCTAATAA CAATGTGTCT 60 0 

GTCAACCCAT AATGGTAACG TTTAATCATA TTAAACGGTT GATAATCtAT TTCTTCATGA 66 0 

3S TTGCTAAATT CACAAACGAT GATACCATTT TCTTTCAATA AATTAAACTC TGAAATTAGT 72 0 

TTTAAAGCTT TATCAATGAG ACCTTTATTA TAAGGTGGAT CTAaGAAAAT GACATCAAAT 78 0 

TGAATATCAC GTTTTGACAA TGCTTTTAAA GCTCTATCTG CATTATTTTT ATAAACTTCA 84 0 

GATTGTGCCT CTAAATCCAA ATT CGCAAG A TTTGaTTTAA TAACTTTTAC AG CTTTAAAA 90 0 

TTTTGaTCAA CAAAGATTAC CTTATCCATA CCTCGAGAGA GTGcTTCTAT TCCAAGCGCC 960 

CCGCTTCCTG CAAATAAATC TAAACCTATA CCTGACACAT CATATAAACT ATTAAAGATA 1020 

CCTTCTTTAA CTTTATCCAT AGTTGGTCTC GTATTACGGC CTTCCATACT TTCTAAAGCT 1080 

TTACTTTTAT GTTTACCTGC AATGACGCGC ATGTTGTTCA CACTTCCAAT TCATTTAGTT 114 0 

ATTTAATATA ATTTATTGAG AAAAAGGAGA ATGATAAACC AATGAAACAA ACATTTATTA 12 00 

CACTTGGTGA AGGTCTAACA GATTTGTTCG AATTCATGAC GATGATTGAA TATAACCATC 12 6 0 

AACGTATTGA TAAAATTATC TATTTTCATT CACCACAAGC TGAAAATAAA AAGTCATCTG 13 20 

TAGCAATCAT TATGAACCCT ACAACTGGCA ATCATTTCCA AGCATTTTAT ATCATGATAA 13 80 

ACGCTATTAA ATATCCATAT CCAGATTCAA ATAAAAAGTT TCAAATGATA AATGATTGTG 14 4 0 

CTGAAAAATT CGACATACCA ATTTTAGGTA TCGATGTACA GCCCCCTCAA GCATTTCATG 1500 

ATTTATCGTT ATATTATAAT TATTTAATTA GTGTGTTAAG GCTCCAAAAA TGGATACCAG 15 6 0 

aACTTCAATA ATATTAATTA TATATTTCGT GTTTCTCTTT TTCGTAAGTT TTCTTTAAGT 16 2 0 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 174 0 

GAACG 1745 
(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



/s (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

TCGTCTTTAA TCTTGCTGAC TTTAGAAGGC TTACGAGTGC GGTTACCATT TTTTGCATCT 6 0 

TTAACTGATT GAACTAAAGC TtGACGTGTA GATTTATCAG CTAAACTAAT TGCACCACCA 12 0 

20 ATTACGGCAC CAATTAAAAT ACCAGGAACA AATTTATTTT CCATAAAAAA CTACCCCTCT 180 

TTCAAATTTG CATCTTTTAC GATGTAGTCT ATTAAATTAT CACAAGATGA TAATACCATG 24 0 

TCGTATACAC CTTCAAAATT ATTCGTGTAG TATGGATCTG GTACATCACT CTCTTCCATA 3 00 

25 

TTACTAAATT CTAACAGTTT GAACAATTGT CCCTTAAGAT TAGGATTGAT AGATTTAATA 360 

TTATCAACGT TACTTTGATC CATAGCCACA ATGTAATCAA AATCATCTGT CGCTTCGAAT 42 0 

AATTCACTAA TCATGCCATC AAATGGAATA TTGTGTTTGT TGAGAATTTT TTGTGTACCT 4 80 

TCATGAGGTG GCTCTCCTAA ATTCCAGCTA CCAGTACCTC TTGAATGTAC TTTAATATCA 54 0 

TGAATATTTC TGTCTTTAAG TCTTTGTCGC ATGATTGCTT CTGCCATTGG AGAACGACAT 600 

35 ATATTGCCAA GACAGACAAA TGCTACATCT A C CATTTTG A TTCCTCCAAA CTATGTAGTT 660 

ATATC C C CAT TTTATAGCGA CTTTAAACAA TAAGAAAGCA GATTATATAA AATTCTATTA 720 

AAGTTTATTA AATTGTGATA CTTTGATAAC ATAACTATTA TTAGAGGTGA ACATTGTGGC 7 80 

40 TATGACAAAT GAAGAGAAAG TnTTAGCTAT TAGAGAGAAG TTAAATATtG TTAATCAAGG 84 0 

ATTATTAGAT CCTGAAAAAT AT AAAAATG C AAATGaAGAA GAATTAACAG ATATATATGA 900 

TTTTGJcTCaA yCAAGAGaAA GATTGTCGCC AAGTGaAGTG mCAGCTATTG CTGaCGCTTT 960 

45 AGGACAATTG CGACACGAAT AGGAGTGGGA ATTTTGACTA ATTACAAAGA AAAGTTACAA 1020 

CAATACGCTG AACTA 103 5 
(2) INFORMATION FOR SEQ ID NO: 504: 



50 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 84 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

AnCCTGACAA GATTCTTATA TT CAT AG AT A TGAGAGCTAA AAATGAAATC AACAAACAnA 60 

CAATAAAGTA AACGATAATA GCCCATATAC CATTTTGTAA CCATATTACA AATTGTGTTG 120 

T ATT AT AG CC ATTTCCAGCT AATAATTGCT GGATAAATGC ATTATTGTTT AATGTATTTT 180 

CAAGATTAGC AAt CGATGTG TTATTACTGA ATG AAA C AAG TGCTATAAAC ATCGTAATGA 24 0 

CAGTAAGCAC T AAT AA CATC ACCCAACATA ACCAACCTAG AACTTTTTCA GTTAATCTAC 3 00 

TTACTGGACG TTTAATTTGA GTAAATTGTT CTCCAGTCAT TCGTTACAAC TCCTTATAGT 360 

ACTTATCCCG TTATTATAAC TAAATATACA GTAAATAACT ACTATTTATG ATTTTATTTT 420 

AATGACATT7 TGAAATTCAA AAAGTTTTCA TTGTATTCAC TTAAAACTTC AGGTCCTAAA 4 80 

TCTTTATAAA CTTCAAGGCG TTCTTGCTCT TTCTTAGTCG GATAAAAACG ATGGTCGTCT 540 

TTAATCTCTT TAGGCAACAA TTGTCGAGCA GCCTTGTTTG GCGTTGCATA GCCTACGAAT 6 00 

TCTGTATTTT GCTTGTTATT TTTAGCATCT AATAAAAAAT TCATAAATTT ATATGCACCC 660 

TCTTTATTTT GTGCCGTTTT TGGAATTACC ATATTGTCGA ACCATAAATT CGATCCTTCT 720 

TTAGGAATAA CATAATTATA TTTATCCCCT TCTTGCACTA GAGGTGCTGC AACACCACTC 780 

CAAACAACCG CTATGTTACC TT CATTTTGT TGAAGCATCA TGGTAATTTC ATCACCTACG 84 0 

ACACCTCTTA CTTGTGGTGC TAGTTTGGTT AAATCTCGCT CTGCTTCTTT TAAATGGTGC 900 

GAATTACGGT CATTAAGATT ATACCCAAGT TTATTCAAAC TCATGCCTAT AATCTCTCTA 960 

GCACCGTCAA C TAG T AAAAT TTGGTTTTTA AATTTAGGAT TATACAATGA CTTCCAACTA 1020 

TCAAATGATT CATTTGGATA CTTTTCTTTA TTATATAAAA TACCTACAGT TCCAAAGAAA 1080 

TAAGGTAAAG AATATTTATT GCCTCTATCA AATGACATAT TCATATAATC TGAATCTAAA 114 0 

TTTTTAATAT TAGGTACCTT ATTATGATCT ATTGGTAACA ATAAATGATC TCTTTTCAAT 12 00 

TTTTGAACTG nATATTCACT AGGAnAAGCA ACATCATAAT GTGTACCGCC AGTGCGAATT 1260 

TTGGnGTCCA nCGCTTCATT TGAA 1284 
45 (2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 12 0 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 1B0 

CAATTTAATG ATACCTGTGT CAT CAT ACT C TTTGAGTAAT AAGCTTGTAA TTAAATAACC 24 0 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 3 00 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 3 60 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

r5 TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 54 0 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 6 00 

20 GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 6 60 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTG CAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 7 80 

25 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 84 0 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 96 0 

30 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

35 AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATCATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

40 TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCCGACAA AACGCCAATA 132 0 

AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 13 80 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATT T CAT CAT 14 4 0 

ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 150 0 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 16 80 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 174 0 



45 
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AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 186 0 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG ACGCTGGGGC GCACCATTCC 192 0 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 19 80 

AATTTTTAAA TATTGACCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2 04 0 

AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

w 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 216 0 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 222 0 

IS TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GCAGTCGCTG 2 280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 2 34 0 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 24 00 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 24 6 0 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2 52 0 

CTG T ATT AAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2 580 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 2 64 0 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2 700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 2 7 60 

30 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2 820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2 880 

35 TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2 94 0 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATAT CA 3 0 00 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3 0 60 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTG CTGGAA 318 0 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 324 0 

45 CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 336 0 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 34 2 0 

SO 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 34 3 0 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CAT AT AG CCA CCCTAAAGAT TTAATCAATA 3 54 0 

£5 
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GTTAGAATTA TGTTGGACTG GTACTACTAT CCAGTCCATT TTTTATGTTT AACATTTTTA 36 60 

GAATCAAAAA AGACATAAGG TCTTGGACTA ATAATTGTCC ATGCCTTATG T CAT ATACTA 3720 

5 

TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT AGGsCCTAAC GGTAGAATGA 378 0 

CACCTAATAC CATTGTGATG ATAATTAATG CAATTGTTAT CCAAAACATT GTGTGACTTT 384 0 

GTTCATGTCT CTTTCTTTTA GCAATCGACA CTTCCATCAA TCCAACTACT GCAACACCAC 3 9 00 

10 

ACAGCATTTT CAATGTAAGC AACATATGAT TTGCCCCGCC ATTCATAAAT GACTGAATTA 3 96 0 

ATATCCAAAA TCCTGAAATT AACGTCAACA GCATAAATAA GCGTAAAATC ATGTGCAACG 4020 

J5 GTTTGAAAAA TGGTGATCTG CCTTGATTTT TTGAAATGTT TAAGTATGTA GCGATAAATA 4 080 

AAATAATCGC TAATACCCAA CTTAATATAT GTAAATGTAA CATACTGATT CCCCCCACTT 414 0 

TAATTATTTA TATTATTAAA TTAAAGCTTC TTGGGATTAA TACCCACTTG CTTGTAATTT 4 200 

20 AATCATGATT TGATTATACA CGAATATATA TTCTACCACA CTTCTATATT TGAGAGGAAG 4 26 0 

AACATGACAT TTTATTCCTT ATTAGAATAT TGTGAATCTG CTGTAAAATA ATCAACTACT 4 32 0 

TCTTAATATC AATATTTCAC TTTCATCTCA AAATGGTAAC ATTATAAATA ATTTATCTTT 4 380 

25 

AACACCTTTT TAGAAAAGCA AGAAAAAACT AACCAATCTA TATAAAGACT GGTTAGCTTT 444 0 

TTAAATGATA ATTATTTAGC GATATAAGTT GTCAGCGTTC CAATATTATC AATAGTCACT 4 50 0 

TTAACTT CAT CACCTGGTTG TAAAAATTTA GGTGGTTGCA TACCTGCACC AACGCCTGCT 4 56 0 

30 

GGTGTAC CAG TTGCAATAAT ATCTCCCGGA TGTAGTGCAA CATATTTTGA AATTTCTTCT 4 620 

ATTAATTCAT CAATTTTAAG AATCATTTCG CCAGTGTTAC CATCTTGTCT AATTTCATTG 46 80 

3S TTAACTTTTG TAACAATATT TACATTTTCA GGTAATGGTA GTTCGTCTTT AGTAACGATA 4 74 0 

TAAGGACCCA TTGGGCAACC GCCAGTTAAA CTTTTTGATA AAAATG CTTG ATCTTGTTCA 4 300 

CTTTGTGCTT TGCGATCAGT GATATCGTTA ATAATTGTAT AGCCGTAAAC ATAATCTAAA 4 360 

40 GCTAATGCTT TTGGAATCTT TTCACCAGAC TTACCAATAA CAATACCTAA TTCACCTTCA 4 92 0 

TAATCTAATT GATCAGTAAT ATCTTTATGA TTTGGAATTG TTGCATTATC TCCTGTTAAA 4 930 

GATGACGCTG CTTTTGTAAA TACATATAAT TTTTCCACTT CATGATTTAA TTCGTTCGCA 504 0 

45 TGATCTTTGT AATTTCTACC AAAAGCAATC ACATTATTCG GAGGTGTTAC TGGTGGTAAA 5100 

AATTCAATGT CATTAAATGA AATTTTATAG TCTTCAGCTT TGCCGCTATC TTCTGCTGCT 5160 

ACAACTGCTT TACGTACTTG TTCTTGAAAA TCTAAAGTAT GATTTTGTTG TAAACCAGCT 522 0 

50 

AACAATGTTT TAGGATGGAA ATCTCCTTCT GCAAAGTCAG CAAATACTTG TGTTAAATCC 52 8 0 

CATACAGCAT CTTCGCGTTT TACTTTAACG CCATATGAAG TTTTGTCATT ATACTTGAAT 534 0 

55 



1433 



EP 0 786 519 A2 



w 



TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 54 6 0 

TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 5 52 0 

TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 5580 

ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 564 0 

GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 57 00 

ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 5760 
ACG 5763 
1S (2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



25 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 6 0 

CTTTTGTGTT TAAGAGACTA TTGTTTT CTT TAGGTTTTTG AACTTCACTC AG TGTATTTC 12 0 

TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 190 

GATGTTTGTC ATTTGTAATA ACTA CG A CAT AATTAAACAC TTCATAATAA TCATTAATTT 24 0 

GATTATCTAA XCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 3 00 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 3 60 

TTG TATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 42 0 

AT 422 
40 (2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT A CG CAAAGG A GACATGGTTA TTCGACCAGC 6 0 

ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 120 
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10 



15 



35 



40 



45 



50 



TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC ATTTCTAGTA AACCACCTAT 24 0 

TTTACTAACA CCATTATTTC CAACTTATTT TTTCCCAACA CATTCTGACA GACAAAATGA 3 00 

AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA GAATTAAAAA ATCGTAAATG 3 60 

TAAAGTGACA TTTATTAATA ATGAATCAAT CATTCTTCAT GTTTCATACC ACAGTTTATG 4 20 

GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA GATAAACAAT CTCGCATGAT 4 80 

ATCAAAAAAT CCCGACCAAC CAATAGATTA TAATAAAGCC ACATTGAATG TGTTTGAAGC 54 0 

ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT TATTTTTAAA ATATGCGGAA 6 00 

TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT AAGGTATCTA TTGCAAATTA 660 

ACTTAATCTT GTTATAATAA TATTTGTGCT TGaTATTCAA ACACATACAA ATTAATCCAC 72 0 

AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG TCGTAGGTTC GAGTCCTACC 780 

TGTGGAGCCA TTGGAAACGT ACT CAAGTTG GCTGAAGAGG CGCCCCTGCT AAGG t GTAGG 84 0 

TCGCGAAAgG CGCGAGGGTT CGAATCCCTC CGTTTCCGTT ACTTGCTAAA ATGGTATATA 900 

CCATTTTAn C TTTTTTGTTT ACTTATATAT AATGAATGAG AATTTCACTG TTCTTTTATA 96 0 

TCAATTTTAA AA1TCTAAAA ACCTTTCCTA GATAATCTTC TCTAAGAAAG G TTTTTAT AC 102 0 

TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA CCTGTTTTTA ACTATAAAAT 10 80 

TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT CTCTGCaATT AATGtACTCA 114 0 

TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA AATAATTC 1188 
(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 6 0 

ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 12 0 

TTGCGAACAA AAt CACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 180 

GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 24 0 

CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 300 

TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 360 
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GGCGGTAACA CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 4 60 

TGTTTTTTTA TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG AAGaGCACGC 54 0 

5 

CTGGaAAGTG TGTAGGCGTC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC CTTCCGTAAA 600 

GGcGCTTAAA TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT CAATCCCTTG 660 

ATATATCTGC ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT TATATCCATT 720 

w 

TAAAATTTTT TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT TTAGtCACAA 78 0 

GATTTTGAAG TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA TTTAGTTTCA 840 
]5 (2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

25 TTCTATGAAA CTATCAGCTG TTTCTTTTTC CTTGGTAAAA ATCGTTTTAA CGCCATGATA 60 

CATGTATTCT CTTTCAaCAC CCGATTGGTC ATAAGCATCG AAAACACAAA TAATTTCATC 120 

TGAAATAACT GCATTATAAT TTGCAATTGC ATCTATTAAT TGCATTCTAG CTTCTTCTAA 180 

30 

ATT CTCTTTT GCAATGGCGC TTAG CGTTGG TGATTGTCCT ATCATATTGT ATCCATCAAT 24 0 

GATTAAGTAA CGTTCTTTCA TTATATTTCT CCAACATCAT GTCTTTTTCG AAATACTTCG 300 

TACATCATTA AACTTGCTGC AACCGAAGCA TTCAAACTGT TTACATGTCC AACCATTGGA 3 60 

35 

ATCTTAATAT AAAAATCGCA TTTATCACTT ACTAGGCGAC TCATACCCTG TCCTTCGCTA 420 

CCAATTACAA TAGCCAATGA CATGTCCGCT TCTAGATTTC TATAATCTGT TGCATTATTA 480 

40 GCTTCAGTGC CAGCTACCCA AAAGCCATTA TCTTTTAGTT CATCGATAGT TTTAGCTAAA 54 0 

TTTGTCACTC GAATAACTGG TACATGTTCA ATTGCACCTG TTGAGGCTTT TGCAACTGTT 600 

TGCGTTAGTG TAACTGAACG ACGTTTAGGA ATAATAACAC CATCAACTCC CGTTGCATCG 660 

45 GCTGTTCTTA AAATTGATCC CAAGTTATGT GGGTCTTCTA AGCCGTCTAA TATAAGTACT 72 0 

GTCAATAAAC CTTCTTTTTC TTTTTGCTGT TTTAAAAATT GATCGAAGTC AGCATATTCA 78 0 

TATGGTGCAA TAAGCGCTGC AACACCCTGA TGTGGTGCAT TTGCTAAAAA ATCTAATTTA 84 0 

50 

GATTTTGGTA CAGTTTGAAC AATGATTTTT TGATCTTTTG CATTTTTTAA AATTTCATTA 90 0 

ATTTGTTGCT TTTTAATACC TTCTTGAATC AATATCTTAT TTAyCGGATG CCCAGTAATA 96 0 

55 
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TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 1080 

TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 114 0 

5 

AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 

TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 1260 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 1320 

w 

GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 1373 
(2) INFORMATION FOR SEQ ID NO: 510: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 
TCAAGTGCCA ACTACACAAC TGATTTACAT TCTTTAGGTC AATATGTACA AGAAGGCCGT 6 0 

25 CGCTTCTTAT TCGAAACAGT TGTAAAAGTA AATCAT CCT A AATATGATAT TACTATTGAA 12 0 

AAAGATAGTG ATGATCTAGA CGGATTAAAT TATTTAGCTG GTAAAACAAT CGACGAAgTT 180 

AACACAAAAG CATTCGAAGG TACATTATTA GCGCATACTG ATGGTGGTGT TCCTAACATG 24 0 

30 

GTAGTGAACA TTCCACAATT AGATGAAGAA ACTTTCGGTT ACGTCGTATA CTTCTTCGAA 3 00 

CTTGCTTGTG CAATGAGTGG ATACCAATTA GGCGTAAATC CATTTAACCA ACCTGGTGTA 36 0 

GAAGCATATA AACAAAACAT GTTCGCATTA tTAGGTAAAC CTGGTTTTGA AGaCTTGAAA 42 0 

35 

AAAGrATTAG rAGAACGTTT ATAAAATACA TTACTTCAAA GATTAGTGAA GTTTGAAAAG 4 80 

ATAGAACTAG ACGTTAACTA TTTAAAGCAT ATTTTCGAGG TTGTCATTAC AAATGTAAAA 54 0 

40 ATGTAATGAC AACCTCGTTT TTATTTATAT GCAAGAACTA GGTTACTAGC TAaTGTGACA 6 00 

AGATGTTAAG AGAAAATTAA aGATAAAATA ATATCTGCCT TACAATAATA TTGTTATACT 660 

ACTAGnGACT GATTTATTAG CATGATTACA TGTTAATGTT TCTTTACTTA GTAATTA 717 

45 (2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 6 0 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 24 0 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 3 00 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 360 

GATCGTATTG GTTCATTATT AACAC CATT A TTATTGATTA CTATTTTAGC GATGATTATT 42 0 

AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 48 0 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 54 0 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 6 00 

ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 72 0 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 78 0 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 84 0 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 90 0 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 96 0 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 102 0 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 10B0 

CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 114 0 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 12 6 0 

CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 132 0 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCGCTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 144 0 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 156 0 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 168 0 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 
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AAGATATGCA TACCGTAATG ATAGATATAG TTTTAAACGT GACTTCAAGC TATATGAATG 1860 
TGATGACTGT TCATCATGTT CTTTGAGACA TCAATGCATG AAGCCAAATT CGAAATCCAA 192 0 

5 

TAAGAAAATT ATGAAGAATT ATAATTGGGA ATACTTTAAA GyCCAAATTA ATCAAAAGCT 1980 
TTCTGAACCA GAAACGAAAA AAATCTATAG TCAAAGAAAA ATTGATGTAG AGCCTGTTTT 2 04 0 

TGGATTTATG AAGGCTATTT TGGGTTTCAC TCGAATGTCA GTTCGAGGAA TAAATAAAGT 2100 

w 

TAAACGAGAG CTAGGTTTTG TATTAATGGC ACTTAATATA AGGAAAATAG CAGCTCAACG 216 0 

AGCTGTACAT TATAAAATAC ATATCAAAAA AGCTGATTTC TATCAAATAA TTAATAGAAA 2 220 

,5 TCAGCTTTTT TACATTGCCT AAGAATTTAA TGTCCCAAGC CCTTTTTATC GAATAACTTA 22 8 0 

TTGTAAACCT TGTCTTTCTT GGTTATTGTT TTCGTTATTT TTTTCGTGTT TTTGTTTCCA 234 0 

CTCTTTTTGA GTCATTACAT CGTCAACTTG CATGTTAACT TCAACTACAT CTAAACCAGT 24 00 

20 AATATATTTA ACTTGTTCTT TAACTAATTC AGTTACTTTA CGGAAGATTT TTGGTGCTGA 24 6 0 

TTCACCATAT TCTAAGATTA CTTTTAAGTC TACAGCAGCT TGTTTTTCAC CAACTTCAAC 2520 
AGATACACCT TGAGTAACAT TGTTGCCACT TGAGAATGCA TTAGTGAATG TATCAGTTAA 2 580 

25 

GCCACCTTTC ATATCTAAGA TACCTTTAAC TTCACGTGCA GCGATACCAG CAATTTTTTC 264 0 

AACAACTTCA TCAGAGAATG TTAATTTGnT TTTGAATTGA GGCTCCTGAT TTnGTTCnTG 2700 
(2} INFORMATION FOR SEQ ID NO: 512: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
« (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 
ATCAAATACA ATAATTAAAA TAaTGGCTAA TACaCCTAaA ATaATAATcG AGTTaGAGTA 60 

GCCTAAG CG A yCACCcTAAC aGTArACATT CTAGGCATAT ATTTaTCTTT AGACATTGAA 120 

GCCGCTAACA TCGGAAAAGC TGTAAATCCA GTGTTGGCTG CTAATACCAA AATCATCACG 180 

GTCGTTGCCT GTACAAAATA GAAGGCAGCA TTATCACCAA ATATTTGCAT AGCTAATTGT 24 0 

GATAAAACTG TCGTTTCCGT TTGTGGcAAA ATTCCATAAA CATATGCTAA ACCAACGATA 3 00 

CCAACTAATA AAAACGCTAA AATTGAACCC ATAGCAATTA ACGTTTTCAC AGCATTTTTA 3 60 

GCACTTGGCT CTCTAAAATT GGTTACCGCA TTTGAAATAG CTTCAACACC CGTTAACGAT 4 20 

GAAGCCCCTG ATGAAAATGC TTTTAATAsC AAGAATAATG TTACTCCAGG AACCGCAGTT 4 80 
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AATATTAATA TCACTAACCC TATAATGAAA AGATATACTG GATAGGATAA TACGGTGGCA 6 00 

GATTCAGTTA AACCACGTAA ATTTAATATT AAAATAAAAA GTACAAGTAw ACATGCAATC 6 60 

5 

AGTrCJtTTAT GCCCATATAA ACTTGGGAAT GCAGCAACAA ATGCATCAGC ACCAGATGAT 72 0 

ATmCTAACAG CGACAGTCAG TATGTAATCG ACTAATAATG AGCCTCCTGC AAGCAATCCC 7 80 

CATTTTTCTC CXAAAXXGGT CTTGGACACC ATATACGCGC CGCCACCTTT AGGATATGCA 84 0 

w 

TAAATAATTT GCCTATAAGA CATAATTAAA GCAGCTAATA AGATCAGAAC AGCACCTGCA 9 00 

ATCGGTAAAG TATACCAAGT TGCAACTGCA CCCACTACTG ATAGTGTAAT CAGTATTTGT 96 0 

!5 TCGGGACCGT AAGCCACTGA AGATAATGCA TCCGACGAAA GAATCGCTAA CCCTTTAAAC 102 0 

TTCGATATCT TTTCGTCTTT TAGTTCTCTG TTTTTTTTAG GTTGCCCTAT AATAAGtCTT 108 0 

TTAAATTGAT JcGAcATAAtC TCCTATTCCT TTTTATAGTT TTnGATGGAA ATCAC 113 5 

-0 (2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 

TTATGGTAAA ACTTTAACAC AAGCATTACA TAAATCGCAA TCATTTATGT GGCAAAAACG 60 

TTGAATTTAA GCCATTTCTA TTCATTTTAC GTAATTATTA GCCGTATATG TGTAATAATA 12 0 

CACATTTTAT TCAGATTTTT TATCGCGCTC CATTAAATCT TTTACGCATT CTTTTACTGA 180 

GATATTTTCA AATAATACTC TATATAATGC ATTTGTAATT GG CAT AT CCA CATTTTTTTC 24 0 

TTTAGCTAAA TGATAAACTG ATTTAGTTGT ATAAATACCT TCAACAACCA TATTCATTTC 3 00 

40 AGATAATGCT TGATCCATTG ATTCACCTTG TCCAAGTTTA TATCCTAATG TGAAATTCCG 360 

AGAATGTGTT GATGTGCAAG TAACGATTAA GTCACCGATA CCACCTAAAC CTAGAAATGT 4 20 

CATAGGATCG GCACCTAACT TTTCACCTAA TCTACTAATT TCCGCTAAGC CACGAGTCAT 4 80 

45 TAATGCAGCT TTTGCATTAT CACCGTAGCC AATTCCAGCT ACGATACCAC TTGCTACTGC 54 0 

GATGATATTT TTCAATG C AC CACCAAGTTC AACACCAATC AAGTCATCAT TCGTGTACAC 600 

ACGCAAATAA TCATTCATAA ATAAATCTTG CGTTAATTTA CTTACACTTT TATCTTTTGA 660 

to 

TGAAGCAGCA ACTGTAGTTG GTTGCTTGAC TACAACTTCT TCCGCATGAC TTGGCCCTGA 72 0 

CAACACGCCA ATACCTGCAT TATATTCAGG TGAAATAGAA TCTTCAATCA TTTCTGACAC 7 80 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 
GTAAATATCT GCAAATTG 918 

5 

(2) INFORMATION FOR SEQ ID NO: 514: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

io <C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



J$ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 6 0 

TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 12 0 

20 TCCATCCTTT AACAATT AT C GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 24 0 

AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 300 

25 GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 360 

TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 42 0 

TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 4 80 

30 

GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 5 87 
(2) INFORMATION FOR SEQ ID NO: 515: 

35 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: B12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

45 GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 60 

TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 12 0 

TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

SO 

TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAG AAAG C 24 0 

TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 3 00 
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TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 4 20 

TTATGATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 4 80 

5 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 54 0 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 6 00 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 660 

10 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 720 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 7 80 

!5 GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 

(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 526 base pairs 
20 (b) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 60 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 120 

30 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTAC CAA 180 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 24 0 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGACTTGT 3 00 

35 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT T G AAA TG A CA AATGTCATTA 4 20 

40 TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 4 80 

ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA TGATGCATTA AACACAGAGG CTGATATCTA TGTAACCAAT 12 0 

AAAGAAAATA CTAAATGGTT ATGCGATCAA TATAAAAAAG AATGGCCATT TGATATGGTT 180 

5 

GTAATTGATG AACTGTCTAC ATTTAAAAGT CCTAAGAGTC AAAGGTTTAA ATCTATTAAA 24 0 

AAGAAATTAC CACTCATTAA TAGATTTATA GGATTAACAG GAACACCTAG CCCAAATAGT 3 00 

TTACAGGATT TATGGGCTCA AGTTTATTTG ATAGACAGAG GCGAAAGACT TGAGTCTTCA 36 0 

10 

TTCAGTCGTT ATCGAGAAAG GTACTTTAAA CCAACACATC AAGTTAGCGA ACATGTTTTT 420 

AACTGGGAGC TAAGAGACGG ATCTGAAGAA AAGATATATG AACGAATAGA AGATATATGT 4 80 

15 TTAAGCATGA AAGCGAAAGA TTATCTAGAT ATGCCTGACA GAGTTGATAC TAAACAAACA 54 0 

GTAGTCTTAT CTGAAAAAGA AAGAAAAGTA TATGCAGAAT TAGAAAAAAA CTATATTTTA 60 0 

GAATCGGAAG AAGAAGGAAC AGTTGTAGCT CAGAATGGGG CATCATTAAG TCAAAAACTA 66 0 

20 CTTCAACTAT CTAACGGTGC AG ITT ATA CA GATGATGAAG ATGTAAGACT TATACATGAT 72 0 

AAGAAGTTAG ATAAGTTAGA GGAAATTATA GAGGAGTCTC AAGGCCAACC AATATTATTG 78 0 

TTTTATAACT TCaAACATGA TAAAGAAAGA ATACTTCaAA GGTTTAAGGa AGCAACCACA 84 0 

25 TTaGAGGATT C AAA CT AT AA AGAACGTTGG AATAGTGGAG ACATTAAGCT GCTTATAGCA 90 0 

CATCCAGCAA GTGCAGGGCA TG G ATT AAA C TTACAACAAG GTGGGCACAT TATTGTTTGG 96 0 

TTTGGACTTA CATGGTCATT GGAATTATAC CAACAAGCAA ATGCAAGATT ATATAGACAA 1020 

30 

GG A CAAAATC ATACGACTAT TATTCATCAC ATTATGACCG ATAACACAAT AGATCAAAGA 1080 

GTATATAAAG CTTTACAAAA TAAAGAACTA ACGCAAGAAG AATTGATGAA AGCTATTAAA 114 0 

^ GCAAGAATAG CTAAGCATAA GTAATGGAGG TATAAGATGG GAAAGGCATC ATACGATATT 12 00 

AAGCCAGGTA CATTTAAATA TATTGAGTCA GAGATATATA ACCTACAAGA GAACAAGAAA 12 60 

GAGATAAATA GATTGAGAAT GGAGATACTT AACCCAACGA AAGAG CTAG A CACTAACATT 13 20 

40 GTGTATGGAC CGTTGCAAAA AGGTGAACCA GTTAGAACAA CTGAACTAAT GGCAACAAGG 1380 

TTATTGACTA ATAAGATGTT ACGAAACCTA GAAGAAATGG TCGAAGCAGT TGAAAGTGAA 1440 

TACTTAAAGT TACCTGAAGA TCATAAGAAA GTAATTAGGC TAAAGTATTG GAATAGAGAT 1500 

45 AAGAAGCTAA AG AT AG AG CA AATAGGAGAT GOATGTCACA TG CAT CGTAA TACAGTTACT 1560 

ACTATACGAA AGAACTTTGT TAAAGCGGTA cGvt ATCATG CAGGTATCAA ATAACATTGT 162 0 

GCAAAGATTG TGCAAAAGGC CTACAAATCT GTAGTAATAT GATAGTATCG GAAAGATGTA 16 8 0 

50 

TAAAGTTATC TAAAAGTTAT ACGACACAAG TACACGAGGC ACATCGCTAT GCGtGTGTCT 174 0 

TTTGTTATGC AATCAAAGAG GTGTAAGAGA TGACCAAGCA TAATAACATT TATAAGCATG 1300 
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AGAT AG CAT T AGATAGGGAT AATCATCTTT GTCAAATG TG TTTACGTGAA GACATAGTAA 192 0 

CAGATGCAAA CATAGTGCAT CATATTATTT ATGTTGATGA AGATTTTAAT AAAGCTTTAG 198 0 

5 ACTTAGATAA TTTGATGTCA GTTTGTTATA GCTGTCATAA CAAAATTCAT G CAAATG AT A 204 0 

ATGACAAAAG TAATCTTAAG AAAATTAGAG TATTAAAAAT TTAAATAAAA AAATAATTTA 2100 

TTTTTATAGC CCCCTACCCA TCGGCTTAAA ATGTTTTTTC GACGGGTACC GGCGGGGGCC 216 0 

w 

CTTCGCTTGC AACGCGGATA AACTTTTATG AAAGGGGGTC TTTATATGAA ATTAACAAAA 222 0 

AAACAGCTGA AAGAATATAT AGAGGATTAT AAAAAATCTG ATGACATATT AATTAATTTG 228 0 

TATATAGAAA CGTATGAATT TTATTGTCGG TTAAGAGATG AACTTAAAAA TAGTGATTTG 234 0 

is 

ATGATAGAGC ATACAAACAA GGCTGGTGCG AGCAATATTG TTAAGAATCC ATTAAGCATA 24 0 0 

GAACTGACAA AAACAGTTCA AACACTAAAT AACTTACTCA AGTCTATGGG TTTAACTGCA 24 6 0 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA GGTGGATTCG GTGACTATTA AAGTTTTAAA 252 0 

TGAACCTTCA CCAAAACTAT TAACAACATG GTATGCAGAG CAAGTCACTC AAGGGAAAAT 258 0 

AAAAACAAGC AAATATGTTA AAAAAGAATG TGAGAGACAC CTTAGATATC TAGAAAATGG 2 64 0 

25 AGGTAAATGG GTATTTGATG AAGAATTAGC GCACCGTCCT ATT CG ATT C A TAGAAAAGTT 27 0 0 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA ACTTG TATTA CAACCATGGC AACATTTTAT 276 0 

TATTGGCAGT TTGTTTGGTT GGGTT CAT AA AGAAACAAAA CTGCGCAGGT TTAmAGAAGC 2 82 0 

30 

TTTGATATTT ATGGGGCGAA AAAATGGTAA AACAACTACT ATATCTGGTG TTGCTAACTA 2 8 80 

TGCTGTTTCT CAAGATGGAG AAAACGGCGC TGAAATCCAT CTTTTAGCAA ACGTAATGAA 2 94 0 

ACAAGCTAGG ATTCTATTTG ATGAATCTAA GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3000 

AGAAAATTTT AGACCTTTGA GAGATGAAAT TCATTACGAT GCAACTATAT CTAAAATTAT 3060 

GCCACAGGCT TCAGACAGTG ATAAGTTGGA TGGTTTAAAT ACACATATGG GCATTTTTGA 312 0 

TGAAATTCAT GAATTTAAAG ATTATAAATT GATTTCAGTT ATAAAAAACT CAAGAGCGGC 318 0 

AAGGTTACAA CCCCTTCTTA TCTACATTAC GACAGCAGGG TACCAACTAG ATGGACCACT 324 0 

TGTTAATATG GTAGAAGCGG GAAGAGACAC CTTAGATCGA ATCATCGAAG ATGAAAGAAC 330 0 

45 TTTTTACTAT TTAGCTTCTC TCGATGATGA CGATGATATA AATGATTCGT CGAATTGGAT 33 60 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT CGATTTAGCT GAAATGAAAG AAGAGTGGGA 34 2 0 

AAAGGCTAAG AGAACACCAG ATGAACGTGG AGATTTTATA ACCAAAAGGT TTAACATCTT 34 80 

TGCTAATAAT GATGAGATGA GTTTTATTGA TTATCCAACA CTTCAAAAAA ATAATGACAT 3 54 0 

TATTTCCTTA GATGAGTTGG AAGGTAGACC ATGTACTATA GGTTATGATT TATCAGAAAC 3 6 00 



35 
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AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3720 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 3 780 

5 

TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3 84 0 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3 96 0 

10 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4 02 0 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4 080 

/5 AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 414 0 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 4 26 0 

20 TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGC C CATG GAAAAATAAA TCTTTTTGGG 4 32 0 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 4 3 30 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TTATAAAGTA GTTAATACAG 4 44 0 

25 AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4 500 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4 544 
(2) INFORMATION FOR SEQ ID NO: 518: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

40 CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA G CAGAAG ATT 60 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 12 0 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

45 TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 24 0 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

50 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 4 20 

AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 4 80 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 600 

TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATT 6 60 

5 

ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 720 

ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 7B0 

GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 84 0 

10 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 
(2) INFORMATION FOR SEQ ID NO: 519: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 519: 

TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGGTATTTG 6 0 

25 ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGGGAG GTTATTATGA 12 0 

AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC AGAATTGGAA AAACTAGCCA 18 0 

AGCAAGGGCA ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA CATATCAATG 240 

30 

GTTC CAT ACT TCATCCTGTT GATGAGATTG AGTCATTCAA TAAAGAAAAA AATAAAACCT 3 00 

ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAGCTAAAC 360 

AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 42 0 

35 

GTTATGATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT AAACAATTTA 4 80 

ACTATCGTGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 54 0 

40 TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTGACATTA 600 

AAAGTTGGGT GAAACAAACA AGG CATACTT TAGTTAAGCT TGATGAAAAT AACAATGGAA 66 0 

TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAGATTTAGA TATAAATTAT TCTGCTAAAG 72 0 

45 GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 78 0 

CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGGGCTTA 84 0 

ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 900 

50 

ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATGTTTGGTT 96 0 

TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 102 0 
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TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 114 0 

ATATTGGTGC TACTGAAAAT G CG AAT CAT A ATTTATTTAT CTAATTAAAT CTATTAATAA 1200 

5 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 1260 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 132 0 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 1380 

10 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 144 0 

TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 1500 

J5 AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 1560 

CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 1620 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 16 80 

20 ATTTAt TAG A AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCG AAACAAATGT 174 0 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 1860 

25 

AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 192 0 

CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 198 0 

GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AG A T AATAAG AG AGTAG CAT 2 04 0 

30 

TTGATCTTCG TAGCAAAGAG G CCTTTC A CG GTGGCCACAC AAAAGGAACA AT C AATAT AC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 2160 

35 ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 222 0 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGG CAT TTCAACCCAG TCCGTTCATA 22 80 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 234 0 

40 ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 24 0 0 

TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 24 6 0 

TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 2 52 0 

45 AT CAAG ATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAG C ATT 2 58 0 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 2 64 0 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 2700 

50 

AT CAAC CAT T ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 276 0 

ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 2 8 20 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA GAAAATATAG 2 94 0 

GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 3 000 

TTTCTAATTT TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT AT AAA G TG TG 3 06 0 

GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCC 3120 

GT 3122 



10 

(2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3982 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 0: 

ATAnAGATAT AnATnAATAT ATTGAGGTCA AACGATGATA ATTAAAATTT TAACAATTCT 6 0 

ATTACTACTT TGTATATTGA GCTATTTGGT TACAAATAGA AAGAAGCCTT TTCTGTTCTT 12 0 

25 AAAGACACTC TTTATGGGTG TGGTATTTAT CTTTATAGGA TATATTTCAC TGGCAATATC 18 0 

TGCCGTAATT ATTTATGGTA TTATTCAATT TATCACAATT GATTTTGGTA GTTTTTTCTT 24 0 

AATGGGTATT ATATTGATCT TGATTTCAAG TATATTC CAA TTATTTATAG TTAGATTACT 3 00 

30 

TTTTAGAAAA AAGAATGTCG ATTTGACAGA GGTTGTCGTT TTAsAsCATT TAATTCAATG 360 

GTTCTTAGTT TACTTTGCGA TCTATCAAGC AGTAAATGAA AAAATGGACA TTAATGATAT 4 20 

TAATATCGAC AATTTCCAAT CTGTCTTTTT TGACGTGTCT AATTTGAATT TAGTAATTCT 4 80 

35 

ACCAACGTTA ATCATTAGCT GGGTCACAAT ATTTAACTAT AGAATGAGAA GTTACAAATA 54 0 

AAATCTATGA GATTATACCT TCAGACACCA ACATTCAAAT GGTGTCTTTT TTGTTGTGTG 600 

40 GTTTTATTTT TGAAATTCGA AAAAGTAGAG GCATGAATTT TTTGACTAGT GTATAAGTGC 66 0 

TGATGAGTCA CAAGATAGAT AGCTATATTT TGTCTATATT ATAAAGTGTT TATAGTTAAT 720 

TAATAATTAG TTAATTTCAA AAGTTGTATA AATAGGATAA CTTAATAAAT GTAAGATAAT 780 

45 AATTTGGAGG ATAATTAACA TGAAAAATAA ATTGATAGCA AAATCTTTAT TAACATTAGC 84 0 

GGCAATAGGT ATTACTACAA CTA CAATTG C GTCAACAGCA GATGCGAGCG AAGGATACGG 900 

TCCAAGAGAA AAGAAACCAG TGAGTATTAA TCACAATATC GTAGAGTACA ATGATGGTAC 960 

50 

TTTTAAATAT CAATCTAGAC CAAAATTTAA CTCAACACCT AAATATATTA AATTCAAACA 1020 

TGACTATAAT ATTTTAGAAT TTAACGATGG TACATTCGAA TATGGTGCAC GTCCACAATT 10 80 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 126 0 

5 AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1380 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 144 0 

10 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 162 0 

15 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 168 0 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 174 0 

20 TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 180 0 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTG TTTTCAATTT GCATATGTGA 186 0 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 192 0 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 198 0 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2 04 0 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 210 0 

30 AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 216 0 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 222 0 

TGCAAGTTCG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 22 8 0 

35 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 2 34 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 24 0 0 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 24 6 0 

40 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 252 0 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2 58 0 

4S AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 264 0 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 270 0 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 276 0 

60 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2 82 0 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2 88 0 

55 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATGCATT 300 0 

AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTGcACA GTATAGAATA 3 06 0 

ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TAT CTTTAGT TTTAATATAA 312 0 

ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA ATGATGAAGA AG T C CAT A C A 318 0 

AAATCCGCAT CATTTG7TGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATG CAT 3 24 0 

GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 3300 

TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 33 6 0 

TTTCATTTGC TTTACATGTA AAAGG CT AT A TATAGTATGC TCTTTATGAT TCTAAATGCT 34 2 0 

TTTTAATATT TAATGCTCAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 34 8 0 

AGCTACGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 354 0 

TAAATAATAG ATTGCGATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 3 6 00 

GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 3 66 0 

CATTTGTTAA CCTCCTTGAA CATTTAAATT GATT CAACTT AAGTTTAACT TATTCATACA 3720 

ACTTCGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 3760 

GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 3 84 0 

TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAACTTTT 3 90 0 

ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT T7AAAAATTAC CTCATATAGT 3 96 0 

GTCATGTTAG CCAATTTTTA AG 3 982 
(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 521: 

AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 6 0 

TGAATATAAA GAAGCTATGA AAAAAT AT c A ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 120 

GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 

TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 24 0 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC C CTG AATTG A AAGACTTTAA 3 00 
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GTTAGGTAAA ACATTTT A T C AAAACTATAG AGATGATGTT GAAAGTTTAT ATAGTAAGTT 4 20 

AGATTTAATT ATGGGATATA AAGATGAAGA AAGAGCAAAT AAAAAAGCAG TTAACAAAAG 4 80 

5 GATGTTAGAA AATAAAAAAG AAGACTTAGA AACCATAATT GATGAATTTT TTAGTGATAT 54 0 

AGATAAAACA AGACCTAATA ATATTCCTGT TTTAGAAGAT GAAAAACAAG AAGAGAAAAA 6 00 

TCATAAAAAT ATGGCTCAAT TAAAATCTGA CACTGAAGCA GCAAAAAGTG ATGAATCAAA 660 

10 

AAGAA3CAAG AGAAGTAAAA GAAGTTTAAA TACTCAAAAT CACAAACCTG CATCTCAAGA 720 

AGTTTCTGAA CAACAAAAAG CTGAATATGA TAAAAGAGCA GAAGAAAGAA AAGCGAGATT 78 0 

TTTGGATAAT CAAAAAATTA AGAAAACACC TGTAGTGTCA TTAGAATATG ATTTTGAGCA 34 0 

TAAACAACGT ATTGACAACG AAAACGACAA GAAACTTGTG GTTTCTGCAC CAACAAAGAA 900 

ACCAACATCA CCGACTACAT ATACTGAAAC AACGACACAG GTACCAATGC CTACAGTTGA 960 

20 GCGTCAAACT CAGCAACAAA TTATTTATAA TGCACCAAAA CAATTGGCTG GATTAAATGG 102 0 

TGAAAGTCAT GATTTCACAA CAACGCATCA ATCACCAACA ACTTCAAATC ACACGCATAA 10 80 

TAATGTTGTT GAATTTGAAG AAACGTCTGC TTTACCTGGT AGAAAATCAG GATCACTGGT 114 0 

25 TGGTATAAGT CAAATTGATT CTTCTCATCT AACTGAACGT GAGAAGCGTG TAATTAAGCG 12 00 

TGAACACGTT AGAGAAGCTC AAAAGTTAGT TGATAATTAT AAAGATACAC AT AG TT AT AA 12 60 

AGACCGAATA AATGCACAAC AAAAAGTAAA TACTTTAAGT GAAGGTCATC AAAAACGTTT 1320 

50 TAATAAACAA ATCAATAAAG TATATAATGG GCA 13 53 

(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

45 ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 120 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 24 0 

50 TGATGCAATC CGTATTTCAC AAGCATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 300 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 3 60 
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TGTTAACAAT GGTGCGATTT ATATTAATGG TGAGAGACAA CAAGATGTTA ATTATGCTTT 4 80 

AGCACCAGAA GATAAAATTG ATGGCGAATT TACGATTATT CGTCGCGGTA AGAAAAAATA 54 0 

CTTCATGGTT AACTATCAAT AAATATAATT GCATAGCTAA ATAAATTAGA GCCTACTCAT 600 

ATTCATTCCT AAGAATGTAA TGAGTAGGCT CTTAATGTAC TTTTCTGTCT GTAAATTATC 66 0 

TAAAGAAATC TCTATCGCCT TGGCCAGGTG ATTGACGTTC TGATCGGCTT TGACGTTTTG 72 0 

GTTGTTCTTT TTGTTGTTTT AATTTCACTT TAACTTCTTT TGTTTTACCA TCACGGATAA 78 0 

CGGTAACAGT GACTGATTCA CCAGGTTTTT TATTTTCATA TAAAT AG CTT CTTAAATCAA 94 0 

CATCATCTTT AATTTTCTTG CCATCAATTT CTGTAATAAT ATCACCTTTT TTAAGATCAA 900 

TATCACTATC AGCTTTGGCG ACATAAATAC CGTCTTCTCT ATCAGTATGA AGTTGCTCGC 96 0 

GCTCTTCTTC AG G AATATC T TTCAAATTAA TTAAACCAAT ACCAATCGAA GGGCGGTCAA 1020 

TTTTACCATG TTTTACAAGT TGTTCAATTG TTACTTTAAC TTCATTACTT GGAATAGCAA 108 0 

ACCCGATACC TTCAACTTGT GTCGCAGCAA TTTTCATTGA GTTAATACCA ACTAAATTAC 114 0 

CATT AATATC TACTAATGCG CCACCTGAGT TACCTGGGTT AATAGCAGCA TCTGTTTGAA 1200 

GAACGCTAAC TTTTGTATTG CCACCAGTTG TCTCAGCGTC AATCGTACGT TCGCTTGCTG 1260 

AAATGATACC AGATGTTACA GAGTTAGCAA ATTGTAATCC TAATGGGTTA CCCATTG CG A 132 0 

ATACGCTATC GCCAGTTTGT ACTTTTGAAG AGTTGGCAAA TTGAATCGCT TTAATACCTT 138 0 

TTGTATTTTC AATTTTAAGT ACAGCAATAT CAGTTACTGc ATCTTTACCA ACTAATTTCG 144 0 

CTTTAACTTG TTTTTTATTA TGTAATTGGa CTCTAATTTC ATTTGCGCCA TCAATAACAT 15 00 

GATTGTTTGT AACGATATAA GCTGAATTGT TGTTTATTTG aTAGATAACA CCTGAACCTA 1560 

CTCCAGCTTC AGATGGTTTA GATGATTTGC CyTTTAATAA GTCGTCTACA CTTGATGCTT 162 0 

TTtGCATGTk AATAACTCCA ACAATTGTAG GGGAGACAGA TTTTATCATT TCATGAACGG 16 80 

TACCGAATTT CTTGCTTTGA CCGTCTAATT GATTGCCACC TTTATTATTT GTTGTCTGAA 174 0 

CAGTTGAACC ATCTTTATTT AAAATTGTAC TATTTAATAC TTTGCCTATA CCAAGTACTA 180 0 

GAAGTGCACC AATAATTCCA GCAATCAATG CAACGATGAC TGTnTTAAAC CATGGAAATT 186 0 

TAGGTCTTCT GTATCTAGGT GTTTGGCTAT GGTTTGTTGT AGAATGATCT GTATGATTAA 1920 

AATCTGACAT ACTTAACCTC CATTATATGA TTTATATATG CTTTAATTAT GTCTTTTnTT 1980 

TATAATT 19 87 

(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

TTAGAAGTCA AATCATTAnT GGCGTCnTAT CGAGTATACT ATTAACTTCA ACTATTTTAG 6 0 

CAATTGCATA TATTTTAATG TGGTTTAACG GCCATATGAC aCTAACTTTG ACCTTAACGA 12 0 

CAATAATTAC AAG CTGTTTA ACCTTATTAA TATGTAGTAT TTTTATTAAT CCACTTATAC 180 

AAAAAATTAA GCAGTTTAAT ATAAAAACTA AGCAATTTGC TAACGGAAAT TACGCAAGCA 24 0 

ATGATAAAAC GTTTAATTCA CCAAAAGAAA TTTATGAATT AAATCAATCT TTTAATAAAA 3 00 

TGGCTTCTGA AATTACGCAA CAAATGAATC AAATTAAATC CGAACAACAA GAAAAAACAG 36 0 

AACTGATTCA AAACTTAGCC CATGATTTAA AAACACCTTT AGCAAGCATT ATTTCATATT 42 0 

CTGAAGGACT ACGTGATGGT ATAATCACTA AGGATCATGA GATTAAAGAG TCATACGACA 4 80 

TATTAATTAA ACAAGCAAAC AGATTATCAA CATTATTTGA TGATATGACT CATATTATCA 54 0 

CTTTAAATAC AGGTAAAACA TATCCCCCAG AATTAATACA ACTAGACCAA TTACTTGTAT 6 00 

CAATATTGCA ACCATATGAG CAACGTATCA AACATGAAAA CCGCACATTA GAAGTGAATT 6 60 

TCTGTAACGA AATTGATGCA TTTTATCAAT ATCGAACGCC ACTTGAGCGT ATTTTAACAA 720 

ACTTACTTGA TAATGCGCTA AAATTTTCAA ATGTTGG T AG TCGCATTGAT ATTAATATTA 780 

GTGAAAACGA AGATCAAGAT ACTATCGACA TTGCTATTAG CGATGAAGGT ATTGGCATTA 84 0 

TACCAGAACT ACAAGAACGT ATATTCGAAC GTACATTCAG AGTAGAAAAC TCTCGTAATA 900 

CAAAAACGGG TGGTTCTGGA TTAGGCTTAT ATATAGCTAA TGAACTCGCG CAACAAAATA 960 

ACGCAAAAAT CAGTGTAAGC AGTGATATAG ATGTAGGAAC TACGATGACT GTAACATTAC 1020 

ACAAATTAGA CATTACGTCA TAATCCGATT TATTTATAAA ATAAAATGCA AAGACTAAAA 1080 

AGAAGCTCCC ATTAATGAGG GCTTCTTTTT TTGTTTATTT AGAATAAACT TTATGGGTAT 114 0 

CCTTCTCATC ATTTTCAAGA CTTGAAAGAT TTGTAGCTTG AATAATATAT TTAGGACGTG 1200 

CCTTAACTTC ATAATATATC CTGC CAATAT ATTCACCTAC AACACCAATT GAAATTAACT 1260 

GTATGCCGCC TAATAATAAA ATAGCTGCAA TCGTTGAAAA ATATCCCGGA ATATTAACAC 132 0 

CAGATATCAT AATATTGATG GAATAAATAG A 1351 
(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

TTGTTGTCAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 6 0 

TATTCCATTT TTAGCCCcTG CCATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 120 

ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAGAAAC TAATGATTAA 180 

AGGACTTATC ATTATTAACA TTTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 240 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTGTGGCAAA AGGTTTATAA 300 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 360 

TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAACGC CCTCCTCTTA 42 0 

TTTTGACCCC nAT 433 

(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1845 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

CCCCCACTAT GATATGCTTG GCCTATTGCG TCAATCCCCT TATATTTCGG CAATAGAGAT 6 0 

GGATGTATAT TCAATATTTT ACCTTCAAAT GAAGCTAATA AGTCTGGACC AATTAGACGC 12 0 

ATGTAGCCAG CTAGAATAAT CCACTCTACC TTATCTTCAT TTAATAATGT TACTAAATGT 180 

TGTTCATACG CTGCTTTTGA ATCAAATTGT TTTGGTTCAT TAATATAAAC AGGAAT AT CG 240 

TGCTTTTTTG CTCTATCTAT ACAAAACGCA TTTTGATGAT C C GT AT AT AG CGCCGTAACT 3 00 

TCAATATTTT CAAGTTTTCC TGATTCAACA TGCTCAACTA TATTTkCAAA GTTACTTCCT 360 

GAACCTGATG CAAAAATCGC AATTTTAACC ATTGTTATAC CCCCAACAAT TCAATTGCAG 42 0 

TTGACTCATT TTTCACAATA TGACCAATTT GATAGGCTTC CACATTTTGT TCTGCTAAAA 4 80 

TCTTCAAAGC GCGTGATACA TCTTTTTCAT CAACGATAAC CGTATAGCCA ATACCCATAT 54 0 

TAAAAATGTT ATACATTTCA TTTGTGTCTA TATTGCCTTG TTGTTGTAAC CAATCAAATA 600 

TTTTTGGTGT TGGAAATGAT GTAGTATCAA TTCTAGCAGC ATATCCGGCT GGCAATGCAC 660 

GTGGAATATT TTCATAAAAA CCTCCACCAG TAATATGATT CATTGCCTTA ATAGAAACTT 72 0 

CTTTTTTCAA AGCAAGTACA GGTTTGACAT ATAATTTAGT TGGTTCTAAA AAGACATCGA 780 
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GCACTAAACT GTATCCATTT GAATGAATGC CACTTGACGC AAGCCCTATA ACAACTTGTC 90 0 

CCTCTTTCAC TTCTGAACCA TCTACATAGT CATCCTTTTC AACTGCTCCA ACAGCAAATC 96 0 

CAGCTACATC ATATTCGCCT TCGTGATACA TTTCACCCAT TTCAGCAGTC TCTCCACCGA 102 0 

TAAGTGCAGT ATTCGTTTCA ACACATGCAT CACTAATACC TTTAACAATT TGTTCAATAA 1080 

CTTCAGGAAC AACTTTGTTT GTAGCAATGT AATCTAAAAA ATATAATGGT TCTGCACCTG 114 0 

TCGTTAAAAT ATCATTAACA CACATTGCGA CTGCATCGAT ACCTATCGAA TCATGTTTAC 1200 

CATAGTCGAT AGCTAGTTTT AATTTCGTAC CTACTCCGTC TGTTCCAGAA ACTAAAACTG 126 0 

GCGCTGtCAT ATTTAATTGT GATAAATCAA ATGTAGCACC GAATCCACCT AAACCACCGA 132 0 

TAACTTCTTT ACGCATCGTA CGtTTAACaw GaCtAGaCaT TCtTTCt ACA GCTTCATAAC 13 8 0 

CAGCATGAAT ATTTACACCA GATTGTTCAT ATGCTTTAGA CATTTAAATT TCCCTCGCTA 144 0 

TCAAAAAAGT GTTTGTTTTT AGAAATATAT TGCTTTTGTC GATGACTTAA ATGCGCTTTA 150 0 

TAATTTGCTT CATAATCATA TAACCCTGCA GGATAATCTC CAGTGAAACT TTCTACACAT 156 0 

AAGCCACTAT ATGGCGCGTC ATAATCTAAA CCAATTGATT CAATTAACCC AT CT A CAG AT 162 0 

25 AGATATGCTA ATGAATCAGC GCCAATATAA TCTTTAATTT CTTCAGGTGA TTTGCTTGCA 1680 

GAAATTAATT CTGCCGTAGT TGAAACATCG ATTCCGTAAA AACTTGGAAA CATAAATTCC 174 0 

GGTGATGCTA TACGCACATG CACTTkATTT GCACCAGAAT CkTTTAACAT TTTCACAATG 18 00 

30 CGTCGAATTG TCGTACCGCG riACAATGGAA TCATCAACAA GACTG 184 5 
(2) INFORMATION FOR SEQ ID NO: 526: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT TTTGGTTATT 60 

TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT GATCTGTAGC 120 

AGGTTGTTGT TGATTGCTGG AmtCACTGTT ATTAGTTGAm TCACTGTTCT CGTTAGATGT 180 

CTTATCTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT CTAAAGGCGT 24 0 

TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT CATTTTTATC 3 00 

ATTAATAGGT GATAATTCTA AGTCTTTACG AAGTAAGTTT GCATATTTTT GAATGCTTTC 36 0 
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AGTTTTAATT TCAACATCAT TTGTTaAGTA CATTTTTGCT AAAGCTTTAA TTTCAGAGTT 4 80 

AGTTAAATTA TGCTTTGCAT TTTTACCTAC AATTTGAATC ACGTTATCAA GTTTATCAAT 54 0 

5 

AGAATCAACT TCCTGTGCTT TTTGGAATAA AATCTTAATT AATTCCATTT GACGTTGTCC 6 00 

ACGTTTTAAG TCTGAATCAT GATGTCTAGT TCTAGCAACT GCTAAAGCCT CATCACCATT 660 

TAATTTTTGG TACCCTTTTT TAATTTTAAT CTTACCAGTA TCATCTGTGT TAGGTTCATT 7 20 

w 

TAAGTCGTAT GGCACATCAT AGTATATGCC ACCAAGCTCA TTTAGAGCCT CGACAAATGC 7 BO 

TTTCATATTG ACTCTCACAT AATAATCAAC AGGTACATTC ATGGTAGCTT CTACCGAATC B40 

JS CATTGCGGCA ATTGGACCAC CATATGCATG TGCATGGGTA ATCTTATCGT AATAGCCAAC 900 

TTTAGGAATG TAGCTGATAG TATCACGTGG AATACTAAGC ATTCTAATTT GATGTTTTGA 960 

TTGATTAAAA GTAGTTAAAA TCATAGCGTC TGATCTAGAG TGTTCAGCAT CCTGTCCTTT 1020 

20 TTTTCTTCTT CCATCGTTAT CATCGATACC TAAGAAAAGA ATAGAGATAG GTTGTTCTTC 1080 

GGGATGACTT TATTATCTCT TAAGTTGGAT TGACGTTAGC ATTTTTGCTG TCTTGAGAAG 114 0 

A 1141 

25 

(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 (Xi) SEQUENCE DESCRIPTION: i 

GCACCATTAG CATTGGGCnC ACTGATTGGA 
AGTAAACmGG CmCaGGCaGC mATamCCCCa 

40 AATGCTAATT TTATTCTGGA TAAGAATTTT 
TTCAATGGTA TTAAATTAGC TAAAACGAAT 
ACTTTTAAAG GTGTTTCTGC AAAAGGAAAC 

45 AATAATATTT CATTAAAAGA TATCCAAAAA 
GGTAAAACAA AGGAAGCTGA TAGCGGTATT 
GGCATCTGGT TTGTCGTTGA TCATAATAGA 

50 

TACAAAACAA GTAAATAAAA TAATGGCATA 
ATTCTAGTCT TTTTTATTAA ATAATAACGA 

55 



!EQ ID NO: 527: 

GTAnCAGTTG TTGAAAATTC GGCGCCAACA 6 0 

TATTATAcTT ATAATGGTTA TATTGGTAAT 120 

ATTAACGCGA TTAAGTATGA TAATGTGAAA 180 

ACGATAAAAA AAGTAGAGAA ATATGATCAA 24 0 

GAAGCAAGTC AATTGCAATT TGTAGTTAAA 300 

GCTTATGCCA AGGACTTGAA AAAAGAAAAT 36 0 

TTTTACTATC AAAATGCTAA AAAGACATTA 420 

GTTGTCGAAG TAACAGTTGG ACATACACCA 4 80 

TTAAGGCTAG AGTGTGAGGA GTGATACCGC 54 0 

TTATTGCGTC TTACATAGTT GTTTGAAATT 6 00 
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GATTAAGTAT ATA GAG C ACT ATTTTGTATT TGTTAATATT TTCACAAAAA TAAAGCCTTG 72 0 

ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT TATTTAGCTA CAGATAACAT 780 

TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC AAAAAGTATA G AAAT AAA T A 84 0 

GAAATAATCA TTTACTTTTC TGAAAAATTA AATTAATATT TTATTTATAA GCTGTTTTTA 900 

AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT ATGCATTTAC AAGTATGGCA 960 

GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG CGATTGATTC AAAAAATAAA 1020 

CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA GTGATGCGGT CAAAGCATTA 1080 

AAAGAATTGC CTAAATCCGA AAATGTAAAA AATATTTATC AAGATTACGC TGTTACTGAT 114 0 

GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC AACCGAGTGT TGATGGTGTT 1200 

CATGCACCTG aCAAAGAAGT GAAAGTACAC GCAGACAAAT CAGGAAAAGT CGTTTTAATC 1260 

AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA AAGTGACATT AAGTAAAGAT 132 0 

GACGCAGCCG ACAAAGCATT TAAAG CAGTT AAGATTGATA AGAATAAAGC GAAAAATCTT 13 BO 

aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG GTGACAGTAA TAAATACGTT 144 0 

TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC ATTGGAAAGT TAAAATTGAT 150 0 

GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA AAGAAGCTGC AGAAACTGGT 1560 

AAAGG 1565 
(2) INFORMATION FOR SEQ ID NO: 52B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 6 0 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTTCTA 120 

45 TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 24 0 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 3 00 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 36 0 

TTGATTAGTG CAGGTATAtC TGGrATTCyA GTCTCTACAA CGCCTATTTT TTCTAGTGCT 420 
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GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 54 0 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 60 0 

5 

AAAAAATACA TAGAAAAATA CGGCTTCATC GCTTTCACAC TATaTACAAT AATGG CAAGC 66 0 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATGAACG ATATACACGG CGCCACTTTT 72 0 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 7 80 

w 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTCTCT TTATTTAACA 84 0 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

j 5 ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTAACTTCTT TCAAAATACA 96 0 

TAATTATTCC AAGTCCCGCA CCTCAGAATC CAAAAACATT CGAGTGATAA AATTTTAAAA 102 0 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 10 BO 

20 CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 114 0 

ATTACAGTTA TGGAAATTAT G AAG CAT TTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 12 00 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAG CTTCACT TGCTGCAGCT TGTTTTTTAA 12 6 0 

25 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 1320 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 13 80 

AAATGGAGAA CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT T CATT AG AAA 14 4 0 

30 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 1500 

CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT 1560 

35 TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 162 0 

ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT TAATTCAAAC TTTTGGATTT 1680 

ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGG AAATG CGTCGCTATC 174 0 

40 TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTTTA AAATTCACTA 1800 

AATATAATCA ATATGAATCT TTAGTATTAC CTATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACG ACAAGTCAAA 192 0 

45 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 2 04 0 

ATGATACACC AGCGCCACCA AC TG ACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

50 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 216 0 

AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 22 2 0 
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TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 2 34 0 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 2400 

5 ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 2460 

ACTTAGGTGT ATCAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 252 0 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGAC CTT 2580 

10 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 264 0 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 270 0 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 276 0 

is 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 282 0 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 287 0 
2Q (2) INFORMATION FOR SEQ ID NO: 52 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 9: 

30 

CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 6 0 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 12 0 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGG TTC TATGACGCTT AAAGAAATGG 180 

35 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 24 0 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 300 

4Q ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 36 0 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 420 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 4 80 

45 ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 54 0 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 60 0 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 660 

50 

TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 72 0 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 78 0 
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TATATGATGT TCAATATTTT CATGTCCATC TTTTATTAAA GAAACATGAT TGAAATCCTT 90 0 

AACTTTATCA CGTGTATTTT CCAAAGCTAA ATCTTGAATG TCGAAACCAT AAACATGTCC 96 0 

5 TTCTGGTACT TGTTCGGCTA AAAATAAAGT GTCATTGCCG TTACCGCAAG TTGCGTCTAC 102 0 

AACAATACTT TCTGGTGTTA TATGTTGTTT AATAAGTGTT TTTGAAAAAG GGAGTATACG 10 80 

TTCTAATTTC ATTGCTTCAC CTTAGACTTG TAACGCAAGC CTTGATAAGA ATTTCTACGT 114 0 

10 

GCTAATTCAG CATCGATGCC ATTTAATACT TCCCATTTAT TAACACTCCA CATTGGACCT 12 00 

ACCATGATAT CTATTGGACC ATCACCGGTA ATTCGGTGAA CGATCATTTC AGGGGGAATC 1260 

ACTTCTAATT GGTCCACAAC TAGGTTTGTG TACTCTTCTT GAGTCATAAA AGTTAATAAA 1320 

15 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA CCTTTTAACA AATGAAGTAA ATGAATTTTA 13 8 0 

ATACCTTGTA CATCCATTTG TGCAcTTCTT TGGCAGTAGC CAT CAT CAT G TCATAGTCTT 144 0 

2 0 CGCCAGGTAA GCCATTAATG ATGTGTGTAC ATACATTGAT ATTATGCTTA CGTAATTTTG 150 0 

CCACACCATC ATAATAAGTT TTCATATCAT GGGCACG ATT GATTAAATCA GATGTTGACT 1560 

GATGGATTGT TTGTAGTCCT AATTCAACCC ATAAGTATGT TCGTTGATTC AAATCTGCTA 162 0 

25 AATATTCGAC AACATCGTCT GGTAGACAGT CAGGACGCGT ACCAATAGAT AATCCCACAA 16 80 

CACCCGGTTC TTTAAGTACA GGTTCGAATT TTTCTTTTAA TACTTCAACC GGTGCATGTG 174 0 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT ATTTTCCTTC GTGCCATTTC TCATGCATCT 1800 

30 

TTTCTTTAAT TTCTTTAAAT TGTACTGCGA TTGAATCTGC ACGATTACCT GCAAAGTCTC 1860 

CGCTACCTGC AGCAGAACAA AATGTACATC CACCATGTGC TACAGTGCCA TCGCGGTTAG 1920 

G ACAGT C AAA CCCGCCATCC AATG CAACTT TAAATATTTT TTGTCCAAAT TTATTTTTTA 198 0 

35 

AATGGTAATT CCATGTGTGA TAACGTTTGT TTTCAAAAGC GTATTGGAAA TGATTGCCCA 2 04 0 

TATGTCATTT TCCTTTCTAT AAAAAAAGAG TTCTAAGTAC AGATTTTAAC ATATTTTAAT 2100 

GTTATAGTGT TTATTATAGT TTGACAAAAA AGAGAGAGGA ACTATGAAAT ATGAATATAC 2160 

4U 

CTAAATCAGT CTGGTGGCTA GTAATTGGCA TGGCGTTAAA TATTACTGGT TCCAGTTTTT 22 2 0 

TGTGGCCTTT AAATACAATT TATATGAAAC AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 2280 

45 GTTTAGTGCT AATGATAAAT TCATTTGGCA TGGTTATTGG AAACTTATTA GGTGGTTCAC 234 0 

TATTTGATAA ATTAGGTGGA TACAAGACGA TTTTAATTGG AACTTTCACT TGTCTTTGTA 24 00 

GTACAACGCT ACTTAATTTC TTTCACGGGT GGCCTTGGTA TGCTGTATGG CTTGTAATGT 24 6 0 

so 

TAGGGTTTGG TGGCGGAATG ATTATTCCTG CGATATACGC TATGGCTGGA GCAGTGTGGC 252 0 

CAAATGGCGG AAGACAAACG TTTAATGCGA TATACTTAGC GCAAAATATT GGTGTGGCTG 25 8 0 
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10 



ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2 7 00 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 276 0 

TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2 8 20 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2 8 80 

TTTTATGGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2 94 0 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGG CAT CATC ATTTTTATGT 3 000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACT AT 3105 
(2) INFORMATION FOR SEQ ID NO: 53 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5532 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 6 0 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTACCAATG GAACTTGCTG 12 0 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTTATAA TGATTTAATT TCATTTATTA 18 0 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 24 0 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG ATCAATTAGA GGTGATTTAG 300 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAGAGTCT GCTGAACGTG 36 0 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 420 

40 GGTTATATGA ATAAAATATA AACTGTAAAC CTTTACGATT TATTTATAAA GGTAGAAAGG 4 80 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTTGTA 54 0 

GTAAATTACT TGAAAAATTT TATAGTTTTT kGGTAAcACG TAt TaAAAaG AGAGGAATAT 600 

TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGG TTGTTA AAATGAAgAA TATTATTTCG 660 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 72 0 

TTAGATAAAC CACTTAATCC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 78 0 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 84 0 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 90 0 
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GATACGATTA TATTAAAACG GCTAATCATT TTTAATTAAT GATTATATGA TGCAACTGTT 102 0 

TAGAAATTCA TGATACTTTT CTACAGACGA ATATATTATA ATTAATTTTA GTTCGTTTAA 1080 

TATTAAGATA ATTCTGACAT TTAAAATGAG ATGTCATCCA TTTTCTTAAT TGAGCTTGAA 114 0 

AACAAACATT TATGAATGCA CAATGAATAT GATAAGATTA ACAACATATT ATAATGTTAT 1200 

CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT GAGATACCTA ACATCAGGAG AATCACATGG 12 60 

ACCTCAATTA ACAGTTATTG TTGAAGGTGT ACCTGCAAAT ATAGAAATTA AGGTTGAGGA 1320 

TATTAATAAA GAAATGTTTA AGCGTCAAGG CGGTTACGGA CGTGGACGTC GTATGCAAAT 1380 

TGAGAAAGAT ACAGTAGAAA TAGTATCAGG CGTTAGAAAT GGTTATACAT TAGGTAGTCC 14 4 0 

AATTACTATG GTTGTAACCA ATGATGACTT TACGCATTGG AGAAAAATTA TGGGAGCAGC 1500 

TCCAATAAGT GAAGAAGAAC GTGAAAATAT GAAACGTACT ATTACAAAAC CAAGACCTGG 1560 

20 TCATGCAGAT TTGGTTGGAG GTATGAAATA TAATCATCGT GATTTACGAA ATGTGCTAGA 162 0 

GCGATCATCT GCTAGAGAAA CAGCAGCTCG AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 168 0 

ATTACAACAG TTAGATATCG ATATATACAG TCGTGTTGTT GAAATAGGTG GAATTAAAGA 174 0 

25 TAAAGATTTT TATGATTCAG AAACATTTAA AGCAAATCTT GATCGTAATG ATGTTCGTGT 1800 

AATTGATGAC AGTATCGCAC AAGCAATGCG AGATAAAATT GACGAaGCTA AAAATGAAGG 186 0 

AGATTCAATT GGCGGTGTCG TTCAAGTTGT AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 192 0 

30 

TTATGTGCAT TATGATCGTA AGTTAGATGG TAAGATTGCA CAAGGTGTTG TCAGCATAAA 198 0 

TGCTTTTAAA GGTGTAAGCT TTGGTGAAGG ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 204 0 

GATTCAAGAT GAAATTCTAT ATAATAGTGA AATTGGTTAT TATCGTGGAT CTAATCACTT 2100 

35 

AGGTGGTTTA GAAGGCGGTA TGTCAAATGG AATGCCAATT ATCGTTAATG GTGTAATGAA 2160 

ACCAATTCCA ACGTTATATA AACCATTAAA TTCAGTAGAC ATTAATACTA AAGAAGACTT 222 0 

40 TAAAGCAACA ATTGAACGTT CTGATAGTTG TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 22 8 0 

ACATGTCGTA GCATTTGAAA TAG CAAAAG C ATTATTGGAA GAATTCCAAT CAAAT CAT AT 234 0 

TGAGCAACTT AAACAACAAA TTATTGAGCG CAGACAATTA AATATTGAGT TTTAACAACA 24 00 

AGAACAATTG AGGTGTAATC ATGAAATTAC AAACAACATA CCCTT CAAAT AATTATCCAA 24 6 0 

TATATGTTGA ACACGGTGCA ATTGACCATA TTAG CACGT A TATTGATCAG TTTGATCAAA 252 0 

GTTTTATATT AATTGACGAG CATGTAAATC AATATTTTGC TGATAAATTT GATGATATTT 2 5 80 

TATCATATGA AAATGTACAT AAAGTTATTA TTCCAGCTGG TGAAAAGACG AAAACATTTG 2 64 0 

AGCAATATCA AGAAACATTA GAGTATATTT TATCCCATCA TGTAACTCGT AATACAGCAA 2 7 00 
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ACGAGGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATGATT CTAGTGTTGG 2 820 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 2 88 0 

5 AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 294 0 

TGGCTATGCA GAAGTTTATA AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3 000 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 3 060 

w 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AACAAGGTGT 3120 

ACGTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GTTGAATACT AT CAT AAAAT 3180 

ACCTCATGGT CATGCAGTGA TGGTTGGCAT TATCTATCAA TTTATAGTTG CGAATGCTTT 3 24 0 

15 

GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATAGAAC TCGGCTATCC 3 3 00 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAATATATGC TAAGTGATAA 3 3 60 

20 AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAATTTGGAG ATATCGTTGT 34 2 0 

ACAACATGTT GATCAACTAA CATTACAACA TGCATGTGAA CAATTAAAAA CATATTTTAA 3 4 80 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 3 54 0 

25 AATAGAAGTG CCGGGCGATA AGTCAATGAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3 6 00 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 3 66 0 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 3 72 0 

30 

CCCAGGATAT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 3780 

GACAACACGA TTATTGG CAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 3 84 0 

CGATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3900 

35 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 3 96 0 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4020 

4Q TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 40 80 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 414 0 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 4 2 00 

45 TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGGAAGTG ATGTAACAAT 426 0 

TCATAATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 4 320 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 43 8 0 

50 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC CAAAAGCAAT 44 4 0 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4 50 0 
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AAACTTGTTA GGGTTTGAAT TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 4 620 

ATTTAAAACA AATGCAACAG TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 4680 

5 AGTTGCTTCT CTACTTTCAA GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4 74 0 

ATCATTTCCA GGATTTTTAC CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4 800 

GGAAGATATC TATAAATTAA TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 4 860 

10 

TCGTGTTAAT GAAGCAATAA CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4 92 0 

ACTTTACAAT TTTGGATTAA TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4 980 

ATATCCAGAC GAAAGTGAAT TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 504 0 

is 

AACTGACGAA GCGTTAGAAT ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AGAAGCAGAT TTATATCAAC AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 5160 

20 AGCACTTGAA CTAGAGCCAA ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

TTATGATGGT CAATATTTAC GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 52 80 

ATATCAAGTT AATGGTGTAA ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 534 0 

25 kAACTATAGT GATkCcGATt CgCTTATACG ATGrAATTAA TGAAGATGAA ATGACTTCAG 54 00 

AAGATTATCT CAAAAGAGCC ATTTCTnACG ATAAAAATGA CATC ACT CAA GAAG CAATTA 54 60 

AAATAATGAC TACATTACTT TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 5 52 0 

30 

aATCaTTATA TG 5532 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGGATAAAA AAGAATTAGC 60 

45 GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 120 

ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 180 

AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 24 0 

50 

TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 300 

ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 360 
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AACTGGTGCT GG CAT AGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 4 80 

TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 54 0 

5 GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 6 00 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 660 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 720 

w 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 78 0 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 84 0 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 900 

IS 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 94 2 
(2} INFORMATION FOR SEQ ID NO: 532: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 6 0 

GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 12 0 

TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 18 0 

TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 24 0 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 3 00 

GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 360 

AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 
(2) INFORMATION FOR SEQ ID NO: 533: 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 3 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 6 0 
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w 



ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA 180 

AAATCCACAA AATTGTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 240 

ATTAAGAAAG AACAAGGAGG CACTTACTAT G ATTA CTT AC AAAAATATTT TAATCGCAGT 3 00 

TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 36 0 

CGATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 420 

TTATGATGCT CAATTTACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 4 80 

TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 54 0 

TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 6 00 

TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 660 

CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 72 0 

20 CCAACCACAA GTT 73 3 
(2) INFORMATION FOR SEQ ID NO: 534: 



25 



30 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 6060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 

TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 60 

CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 120 

35 

TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 180 

TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 240 

40 TGATCGTGCT ATTAAACAAA CAC CAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 3 00 

TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 36 0 

AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 4 20 

45 TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 4 80 

TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 54 0 

CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 6 00 

CTGATTTAAA CGATTTGAAG CAATCGATTG CACAACATCA AACCATTGAC TTATCTGTTG 66 0 

AAGAGCAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTT AAA C AAT 720 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCGTGGC GAAGGCATTT 84 0 

ACTTGGTAGG TACTGAATCA AAGAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 

TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 960 

AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG 1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAATCAAG TTATTTAACT TTAACTGTCC 1080 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGtA TGTCGCATTA AACGATGATA 114 0 

TTTATGATTC TGTACAAAAC ACATTTGAAC ACAAAGT a G c AAGCGAACTT GCTGATAAAC 1200 

TTGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACGAAAA AATCTTAATG ATACATCATT AAATAATCGT TGTGAAGAAA 13 20 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 13 80 

TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 14 4 0 

CTAATATCGA GACCTATAAT CCGTTAACAG ACATGATTAA GTTCAAATAT CCAAGACTAT 1500 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

AAATTGCTTT TATAGTtTTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 1620 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 1680 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 1740 

CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT GAATTTAGAC ATCGACTCCC 1B0O 

CCTATTTAAC GGTAAACCCA TTATTACCAG AT AG TG AT AT CAGTTATGTC GCACAGTTTT 1860 

TAAATACAAA GTCTACGTTC CAAGAGACGC ATGATAAATC AT CAAACATG ATTGATAAGG 1920 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTGAA AATGAACAAA 1980 

CTT CATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA GAACAATTAT 2 04 0 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATC 216 0 

CAACGTCATT CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGGATATTAA 2220 

45 GTCCATATCC TGTTGCAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2280 

TAATCACAGT TTTAGAAGAA CCGTTAACAT TGCCTAGTAT TCAAAATGAC AATCAAACAA 2 34 0 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 24 00 

50 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TACTTTTATG GAAAATCCAC 24 60 

AGGAACTTGA AACATTATTA AG AAA C AAA T TTTTAGAACG AATTAAAAAA CAATTTATTT 2 520 



35 



40 



1467 



EP 0 786 519 A2 



TAACAGCCAA AATGAAGCAA TTGAAAAAGC 
AACAGATGCT TATATTCAAG CAATGAAAGA 

5 

AAATGGCTTA GCAATTCCTC ATGGCACAGA 
TTTAACATTA TTACAAATCC CTGAAGGCGT 
CGTGGGAATT GCTGGTAAAG ATGGCGAACA 

10 

ATTTAG CGAA GAAGAAAATG TGGATCGTAT 
ACAAGTATTC GAGGAGGCAG ATGCATAATG 

15 GGTCGTGGTT TCATTGGTTA TATTCTgCAG 
TCAATGAAGA AATCATTAAT GCTTTAGCTC 
ATGAGTCTAA AACAACGACG CGCGTGAAtA 

20 CTGAAGCGTT GAAACAAGCA ATTCTAGAAG 
ACATACTACC TATTATTGCT AAATCTTTTG 
TTAATATTGT TGCTTGTGAG AATGCTATTA 

25 TTGATATTAC TGGCCCTCTT GGTAACnaTA 
TTGTACCATT ACAAAAGAAT GAAAATATAT 
GGGTTGTTGA AAAAGATGCA TGGTATGGTC 

30 

ATTTAACACC ATATATTGAG CGTAAATTAT 
CGTATgCTGG tAAATTTGCA GGTAAAGCTA 
TTGAAGCTGG CTTACGCCGT GTTTTAGCTG 

35 

ATTTTACTGA AGCGGAACAA GCTGGTTATG 
CTTATTTATC TGATGAAGTA ACACGTGTCG 

40 AAGATAGAAT TATAAAACCA TTAACATATC 
TATTAAATAC AGCTGCATTG TTATTGAAGT 
AGAAAAATAA TTACATTAAA GAACACGGTT 

45 TTGACGATGG CTTAGCCGAT GAAATAA7TG 
AGCTTTGTTT GAAACAAGAA GTTTCCAACG 
AGTTCCCTTG AATTAACAAT ATTAAATTTC 

50 

ACACTTAACT TGTGTTATGT ATGAAAGGAG 
CGAGCCACAG CCACCTATTC AATGGCTATT 

55 



AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 264 0 

TCGTGAGCAA GTCGTATCAA CATTTATGGG 27 00 

TGAAGCTAAA ACAAATGTGA TTCACTCAGG 27 60 

TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 282 0 

TTTAGACTTG TTATCTAAAA TTGCAATTAC 2880 

CGTTCAAGCA AAATCTGCAG AAGAAATTAA 294 0 

AAAGCAGTTC ACTTTGGTGC TGGTAACATA 300 0 

ACAACAATGT TAAAGTAACA TTTGCAGACG 3 06 0 

ATGATCATCA ATACGATGTT ATTTTAGCTG 3120 

ATGtTGATGC AATTAATTCA ATGCAACCTT 3180 

C TG AT ATT AT TACAACAGCT GTTGGTGTTA 3 24 0 

CG CCTTTCTT AAAAGAAAAA ACAAACCATG 3 3 00 

TGGCAACTGA TACATTGAAA AAAGCAGTAC 3 360 

TTCATTTTGC TAACTCAGCA GTTGATAGAA 3420 

TAGACGTTAT GGTTGAGCCA TTTTACGAAT 34 8 0 

CAGAACTAAA C CAT ATT AAA TATGTTGATG 354 0 

TAACTGTGAA TACAGGACAT G CAT ATTTAG 36 0 0 

CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 36 6 0 

AAACTAGTCA ATATATTACT AATGAATTTG 3 72 0 

TTGAAAAAAT AATAGATCGT TTCAACAATT 3780 

GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3 84 0 

TTTATAATAA AGATTTAGAA CGCACTGGTT 3 900 

ATGATGATAC AG C AG AC CAA GAAACTGTTG 3 96 0 

TAAAAGCGTT TTTAAGTGAA TATGCTAAAG 4020 

AAGCGTACAA TTCACTTTCA TAATTTATTG 4 080 

TTATTCGTTA ACAATCAGTA ATAATGTAGT 414 0 

TGAACATAAA AAATACTCCC TTCAACATAG 4200 

TATTTTTGCG TTAATAATTT GTTTTATTTT 426 0 

GGTCATTACT AAAACAAATT CATATTAACT 4 3 20 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 4 44 0 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CGTCATTATG TGACATACCA 4 500 

GTATCTGGTA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4 56 0 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4 62 0 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4 68 0 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 4 74 0 

TCTTTTTGGC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4 800 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4 86 0 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4 92 0 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4 98 0 

TCAGTTGTtG ATTCAACCAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 504 0 

GCTTTATTTT CAGATTTTGG TTGTGCAACC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTATTTACTT CAGCATTTTG TTTTGAATCA TTTACAGATG CATTATCTTT GCTATCAGCA 516 0 

GATGATGCTG CTTCTGTGCT CGCAGTTGTT GGAGCCGTTG CTGTTGATCC TGTTGGTGCA 522 0 

TTCTCGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 52 60 

TTATCAGTTT CTG TT AC AGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT 53 4 0 

TTACTAGTTA CATCGTTATC CATTGTCGGA CTGTTTGTTG ATGCATCTAC ACTAGAATTG 54 00 

TTATTAGCTT GCGGTTTATC ATTTGCATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 54 60 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTGCT 5 580 

GTTGCATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATT CTG AGG C 5 640 

ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TGCATTTGTT 5700 

TGTGTTTTAT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 5760 

ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5820 

ACTTCTCTGA TTGCTTCCAA AGCATTTTTC TTAACTTCAG GATTAATACG TTGTGCTTTA 5880 

AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5 94 0 

ATATCTTCAG TTGCTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 

TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 6060 
(2) INFORMATION FOR SEQ ID NO: 535: 
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(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



is 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 

w AACAAAGCCT TCCAATTATC TGCGTCGGTA GAACAAGTAT TAGCAACTTT ATCACCTACG 6 0 

CTAAACAGTC CTTACGATTT ATACGGCACG ACAAAAATGC TAGATATTAC ATTCGATTCA 120 

TTTGAACATG ATGGTACAAC GTACCCTGTC GACTATGCTA CGTTTGAAAA TGATTATGAA 180 

GATAATAAAG ATCCTGAGTT TAGACGTAAA AGTTTCAAAT CGTTTAGCGA TGGGATTCGA 24 0 

AAATATCAGC ATACTACCGC GGCTACATAT AATATGCAAG TACAACAAGA AAAAATTGAA 300 

GCTGATTTAC GTGGATTTGA ATCAGTCATC GATTATTTAT TACATAGTCA AGAAGTAACG 360 

CGTGATATGT TTGACCGTCA AATCGATATG ATTATGCGTG ACTTGGCACC AGTTATGCAG 42 0 

AAATATGCTA AACTTTTACA ACGTATTCAC GGATTAGATA ACATGCGTTT TGAAGACTTG 4 80 

AAGATTTCTG TAG AC C CTG A TTATGAACCA GAGATTTCAA TTGAAGACTC AAAAAATTAT 54 0 

ATTTTCGGTG CGTTAAGTGT TTTAGGTGAT GACTATACAA ACATGTTACG TGAAGCATAC 600 

GATCAGCGAT GGaTTGATTT TGCACAAAAT AAAGGTAAAG ATACAGGCGC ATTTTGTGCA 66 0 

AGT CCATACT TTACACATTC ATATGTGTTT ATTTCTTGGA CTGGTAAAAT GGCTGAAGCA 72 0 

TTTGTCTTAG CACATGAATT AGGTCATGCA GGTCATTTTA CATTAGCTCA AAAACATCAA 78 0 

CCATATCTTG AATCAGAAGC ATCAATGTAC TTTGTTGAAG CCCCTTCTAC AATGAATGAA 84 0 

ATGTTGATGG CCAATTATTT ATTTAACACA AGTGATAATC CAAGATTTAA GCGTtGGGTT 90 0 

ATTGGCTCAA TTTTATCTAG AACATATTAT CATAATATGG tACCCmTTTA TTAGAAGCnG 960 

CTTATCCACG GGGAGTG 977 
(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 120 

55 
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AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 24 0 

TTATG CTAAT CCAGCGCGTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 3 00 

5 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA AT AG ACT CAT CTTGTTCGAT 36 0 

TAATTCACGG AACGGTGTTT TCGTTTCCCA AGATATCATC GCTTTTGGTT GTACTTTGTC 4 20 

GTATGCTTCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 4 60 

W 

AATCAGACCA AATGTTTTAT CTATGTTATT AC G CAT ATT A TCTTCAAATA CAGTTAAACG 54 0 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 60 0 

/5 ATAATACGCT CAGCAGAAGA ATGAGAAATA TsTCTTTCAT GCCATAATGG CACATTCTCA 66 0 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 72 0 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 7 80 

20 TCAACTTCTC TCGTTTCGGT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT B40 

GATGTCGCGA TTAATGCTAA TGTCGCAATA TAGTATGCAT GTCGATCGCG TTGCAATGTT 900 

TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 960 

25 GGAGGAATGT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 1020 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 10 80 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 114 0 

30 

TTATAATTTT TTGCTTTTTC AGCTAAAACG TCGATAAATC TTTCTAAATC TTTTT CAATA 12 00 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 12 60 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTTGT CTAGTAAAGG 132 0 

35 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 13 80 

CGTTTTGACG AATTTTTTGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 14 4 0 
40 (2) INFORMATION FOR SEQ ID NO: 53 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
4 $ (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 

50 

GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 6 0 

TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 120 

55 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 24 0 

TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 3 00 

AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 360 

CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 4 20 

TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTTT 4 80 

CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 54 0 

ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 60 0 

1S CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 660 

TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 720 

TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 780 

20 TATC 7 84 
(2) INFORMATION FOR SEQ ID NO: 53 8: 



25 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 3: 

CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 6 0 

CAATCAAGAC TGTTTTCCAC TCAATATATT CATCCATTAG CGTAATAGAT TATTTGACTT 12 0 

35 

CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG TAATCATCAC 180 

AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 24 0 

40 TAGGCTTAAC ATCT CTT AT A CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 3 00 

AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 360 

CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 420 

45 CAAAGCTTAC ACCTCTATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 4 80 

TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 54 0 

GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 600 

SO 

GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 660 

ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 720 

55 
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AGTATACCAT TAATTTCAAA ATGACTCATA GCAATTCATT TTATACTATA AAATTTACAT 84 0 

GTATACTTTT ACGTTAGATT TCATTACACA TATTTGCATT CAAATAACGA AACGCTTTTA 90 0 

ATAATTACTA AGGGGGAATT GATATGATTA GATACGCTAA AAAAGAGGAT TTAAACGCTA 96 0 

TATTAGCGAT ATACAATGAT GCCATTATCA ATACTACAGC TGTTTATACT TATGAACCAC 1020 

AAACCATAGA CGAACGTGTC GCATGGTTTG AAACGAAACA ACGTAAGCAT GAGCCTATCT 10 80 

TTGTATTTGA GGAAAATGGA AGTGTCTTAG GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 114 0 

GGCCAGCATA CCTATATACA ATCGAACATT CTATTTATGT CGATGCTTCA GCTAGAGGAA 12 0 0 

AAGGTATTGC TAGTCAATTA CTACACCATT TAATTGTGGA AGCAAAAGCT AAAGGTTATC 12 6 0 

GTGCGCTAGT TGCAGGCATT GATGCTTCCA ACAAAGCGAG TATTCAGTTG CATCAAAAGT 13 20 

TTGCTTTTAA GCATGCCGGC ACACTGACCA ATGTAGGTTT TAAATTTAAT AGATGGTTAG 13 BO 

ATTTAGCA7T TTACGAATTA GATTTACAAG ACTAGTAATG TTTGAATCAC ATAATATAAA 144 0 

CAAGACAACC ATGTTAATTC CCTTAACATA A CAAGC CAA C ATATAAAATT TTAAACTTCT 1500 

CAGGGGAGTG GGACAGAAAT GATAAAGAGC CACTAATGAT TTATTATGTA GTGGTTCTTA 156 0 

CACATTAGCC ACAGCTAATG TGTAcTTAAA AATAGGAATA CATGAGTAAA ACTCATGCAT 162 0 

AAGAAATACT AATTT CT AT A GAAAAAGTAT TTCTTTATCG TCGTCCCACC CCAACTCGCA 168 0 

CATTATTGTA AGCTGACTTT TCGTCAgCTT cTGTGTTGGG GCCCAAAAAG CTTGTTACAA 174 0 

GCGCATTTTC GTTCAGTCAA CTACTGCCAA TATAACTTTG TAGAGCATAT TACATTGATT 180 0 

TACATTGTCC CTTTTATTTA TTCTTTTCAA ATACTATCCC CATAGCTTTG ATTTAACGCT 186 0 

TTTTCTCAAT AACAAAACGA ATATAGTAGA ACATGAAAAC GATAATCATG CTGAGCGATA 192 0 

AAGATTTAAA TAATAGATTG ACCCACGTTC CCTCAGTCGT ATATCCATAT GTAATCGTTG 1980 

TGTTAATGAT GAATGCTATA AAGATGATTG ATAGTCTTAG CATATCATCA CTCCTTTTAA 204 0 

GTTATTTTAG ATATACGGGG GCGCTTTTGC AATCACTATT TTGATTAGTA TGCATTTTCC 2100 

ATAAATCTTT CAACTTCTTC AGAGATAATT AAGAAGCATC TATCTGGTAC TAATGATCCA 2160 

GACAGATGCT TCTTTTTTAT CAATATTTTA TTGTTATCTC ATTAATTATT TTTAACCATA 2220 

45 TCTTCAGCTG TGCCAAAGAT TTTACGTTTA ATTGCTTCGC CAGTTGGTGT GCCTGCTAGT 22 80 

CCACCCAATC CAGTTTCACG TAATGATGCA GGAAGGTTAC GACCAACCTT ATCCATTGCT 234 0 

TCAATAACTT CATCAACAGG GATTCTACTT TCAATACCTG CTAATGCTAA ATCTGCTGAA 24 0 0 

50 ATTAAAGCGT TACCCGAACC AATTGCATTT CTCATAACAC AAGG AATTT C AACAAGTCCG 24 60 

GCTACTGGAT CACAAACTAA ACCTAATAAA TTACTTATCG CTAATGCCAT AGCGTGCCCG 252 0 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTACG 2 64 0 

ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 270 0 

TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 276 0 

GTtGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 2 820 

CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 2 880 

CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 2 94 0 

TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTG CGCTCGCACT TCATCACGTG 3 000 

ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATATTATTTT 3 060 

IS 

CTACGGCATA AT CT AT AG TC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 3120 

ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 3180 

20 ATTGCTTTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 3 24 0 

TCTTTACAGT C T AATTT CAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTG CTTTA 3 3 00 

GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 3 360 

25 TTTATATGAA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 342 0 

ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG TTGACCAATA 3 4 80 

CTATCGCCTT CTTCTTCGAT GATATCTATT TTAAT AC CAT CATCAGCTGC AATTTCTAAT 3 54 0 

GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 3 600 

TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATG ATT CAT AATAATGTAT TTCAATATTT 3 660 

TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTTTAC TGCACCAGCC GTATGAGAAC 3 72 0 

TTGAAGGGCC CAT 3 73 3 
(2) INFORMATION FOR SEQ ID NO; 53 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 

TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 6 0 

GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 120 

TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 180 



55 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 3 00 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 36 0 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 4 80 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 52 5 



w 

(2) INFORMATION FOR SEQ ID NO: 54 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 08 base pairs 

(B) TYPE: nucleic acid 

1 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 

TTGATTTGCT ACAAAGTATC TnCTCATTTC TGTATCCTGA AAAATCTTTA GTGTAATAAT 6 0 

GTTGTTCAGT TTTAATATTT TCAGTCATAG TTGACTACCT CCGTATATTT TGATTTAATT 120 

25 AAGTTGTATA TTTTGATGAA CACTTATTGT TACTTGTTGG CGCAAGTAGC AGTTTTTTCA 180 

TTCTTCATAA AAGTATTCCT TATAGAATAT GAATGTTGCG ATACTTGCGA ATCCTGCAAT 24 0 

TGaCCATGCT GtAGTGAAGT ATAGAAACGG CATAAGTACA ATCGCTAAGA CTGTGAAGCA 3 00 

30 

TAGTACTGCT ACTAGGTAGC TTTTATAAAT GTTACTCATT TTCTTTTTTC AACTCCTCCA 3 60 

TTATTCTCTG GTCTGATAAG TCGTGATAAG GGAATTTTTT CCtAGCTAAT TGGACTGGTA 420 

TTCTGCCTCG TATCGCAATG TATCCTTCAT CTTCAAGCTC TTTATTCAGT TCTCTTATTA 4 80 

35 

TTTGTCCTGC TTTGGATTTA GAAACAGATA AAATTACCgC AAGTTCTTTA GCTTGCAAAC 54 0 

TATTTTTCAT CATATCTTTT CCTCCTTTAA AATAACTGTT GATTCTCTGG GTTATCTGCT 6 00 

TCGTAATTAT CTGCAATAAT ACTTTTAGCG AAAAAGTCCA AACTGACCTT ATATAGGTTG 66 0 

TTCATAGATT TCTTTACGTT AACCCCTTCC TCAAGTACAT AAGGCACCCT AAAATCATTT 720 

ATAAACAGTC CGTTTTCGTC TAAAGTAACG GTTGGTAATT CAGGTTTGTT CCGTCTATAA 78 0 

45 ACTTCTCCTA GTGTAGGTTT TTGCTTTTCA GCTTGTTTAG TGAAGTCGGA AAATGCCTTA 84 0 

AGTAGTTTTA TTCCTGAATC AGGATCACTG TGTCGCTCAA TCGTTTCTGC TGTAGACTCT 900 

TTACTAAAAT CATTTCTATT GATTACAGGC TTTCTCGTAT TTCGTTCAAT CTTCCAAACC 96 0 

50 TTCCACGTCA CAACTGCCAT TGTGGTGAGG AGGGTTGTTT TGTATAGTGC GTTCATTTGT 1020 

AATTCCTCCT ATTAAGTTGT TTGTTCAATT GTGTGTGTTA TTCTTCTTCG TCTAAATCAA 108 0 

55 
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CGACTTCGTA AGTGTGCTCA ATCTCGCCTG cATATGTCAC AGTAAGAGTA TCTTTGTGTG 



1200 



TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 



1260 



TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AG CyT CAT AA CCGAATTCAG 



1320 



TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 



1380 



CTTCTCTGCC ATCAGAACGC ACAATTGT 



1408 



w 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 6 0 

AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 12 0 

TCTTGAAATT TCAATAT A CG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGGCTTGTT 180 

ATTAGTAAGT AGTTTAATAC CACTGATTAA C CAT AAAG C A AATGTAATTA TGTTACCACT 24 0 

TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 300 

CAT AC CG ATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 36 0 

TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC CATTAACTGT 42 0 

TGTTGGACAC AT 43 2 

(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 60 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 12 0 

GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 180 

AACATTACAC AT AT CAT AAA CTGAATACGT TACAATCTTT AG AT C AACGT ATTGCAAATA 24 0 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 3 60 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 420 

5 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AG AT CAAATT ACTCGTCATA 4 80 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 54 0 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAACGAA ATTAGGAAAA GCTGAAGCAG 6 00 

10 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 660 

TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

15 TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CG TATAG AAA 780 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AGrAACAGAT CAAAATAAAA 840 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

20 AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGTTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTT AG CAA 102 0 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

25 CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATGCTGAT 114 0 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 1200 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 126 0 

30 

AAT AAAG CAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 1380 

CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 144 0 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCG AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

40 GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 1620 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 1680 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 174 0 

45 GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTGCAAAT GATATTAATC 1800 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 1860 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

SO 

AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 204 0 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 2160 

ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 222 0 

5 CTC CTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA C CTGTAAT A T 2280 

ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 234 0 

ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 24 00 

10 

ACTTTTTnCA ACAAATGTCT nACATG 2426 

(2) INFORMATION FOR SEQ ID NO : 54 3: 

(i> SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 3: 

GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 60 

25 GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 12 0 

GAAATAATCA TATTGGAT G A ACCGACTGCT GGTCAAGATT TCTAT CATT A TAATGAGATA 180 

ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 24 0 

30 

ATGCATTTAT TGTCTGAGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 3 00 

GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA TCTGTGAGAT TGCATCATTG 36 0 

AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 42 0 

35 

GTACAATTAT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 480 

TAGGTTTTCA CCCGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTGTT AAACTTTTAT 540 

TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 600 

40 

TTAGTGCTTC ATCTATTTTA TTGGTCAAAT ATGCTCATAT TGAATGGAAA CAAGTTCGCT 660 

TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 720 

45 ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 780 

GATTTACGCT AA CATC AC AG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 84 0 

GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 900 

50 GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 960 

ATATTCCAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 102 0 

55 



1478 



EP0 786 519 A2 



TACCTTTAAT ATTTTCTAGT AT CGAAAGAA TTGACACTAT TAGTACTGCT ATGGAGTTAA 114 0 

GACAATTCGG ACAGTATAAA AGGAGAACCT GGTACGTCAA AAAACAATTA AAAAAAGATG 1200 

5 ATTATGTTGT TTTGTGTTTG ACGTTAATAC TTCTGATGTT AGTAGTTACA TTATTCTTTT 1260 

TAAATAATAG TCGATATTTC AACCCGTGGC ATTAGTATTC ATATAAATAG TCTTTAAATA 1320 

GAAATAGGAG GGAGACATTT AATGATAAAT ACTGAAAGAT TAAATTTAAT GATTCCAAGT 13 80 

10 

TCCTCGCATT TAATTGAACT TTATAATATT TGTAGTCATC CACAAGCAAA TATATACACT 14 4 0 

CCCAAAGGTT TACATAATTC CAAATTAGAC ACACAACGGT GGATTGAAAA ATGGCGAAAC 1500 

15 CATTGGCAAC AATATCAATT TGGTTACTTT GTATTGGTAA AAAAAATAGA TTGTAGTGTT 1560 

ATTGGTATTT GTGGATATGA ATATCGACAA TTAAAGCAAG AaACAGTATT AAATTTATTT 162 0 

TATAAATTAC ATCCAAGTTT TGAAGGACAA GGGTACGCAT GTGAGGCTAT TACAGCAATC 168 0 

20 ACAAATTTTG TGAATTATAT CGATCAAGAA A CAGTAAAAG TTATCAGGAC AAATAAGTGT 174 0 

AACCAACGTT CAATAAATTT AG CAGAAAGG CTTAAATTCA AGCGAGACGA TACTATGGAC 1800 

GACATTATCA ATCAAGGAGA TATTGTGTTT TAAnAATAAA ATACTATGAC ATT AT CT AAA 1860 

25 

AAATAAAATT AAAA 1874 

(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 
TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC TTGATTTGTA 6 0 

40 

ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA CTTCATCTCC 120 
TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT TGATTTTATA 180 
AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT ATTATTAATG 24 0 

45 

TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA TTGAAAATTT 3 00 

TAATCaGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA CAGTTTTTAA 3 60 

so AATAAAAATG GTATAAAGTG TGATTTGTAT AAAAAAGAGT CTCGACGGAT AAGAATTGAT 4 20 

TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG TTAAATGAGG 4 80 

TTTAATTTGA AAGGGGATAG CGCCTCAATA TAATGTAGGT AGATTGTTCA TATTACGTAA 54 0 
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AAGCAAGTTT AGCATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT G T ACAATCAG 6 60 

TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 720 

AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 780 

ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 84 0 

ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 900 

GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 96 0 

TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 102 0 

ATTTTTACTA CATTTCAAAA GAAGACATCT CATTAAAAGA ACTTGATTAT AGATTAAGAG 10 8 0 

15 

AACGTGCGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 114 0 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 12 00 

20 GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 1260 

ACTTTCTAAC AAC AAAGCG C TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 132 0 

AGCGACGTAC TGTTGCTTAG CTTCTTTTTT TGAGGGGAAA AGTTACAAAA CTCACACAAA 13 8 0 

25 CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 144 0 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 150 0 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 156 0 

30 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAGCAACAGG TGTAAACACT A C AACGG AAA 162 0 

AACGAGTTCA TGCCGAAAAG AAACCTATTG TAATAAGTGa AAATAGCAAA AAATTAAAAG 16 80 

35 CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 1740 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 1800 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATTTTACG 1860 

40 TTAATGAGGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACGrTaAAGC TGTCGACCTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AGCGATGTCA TTGAAAGAGA 204 0 

TTGATTTTAA ATTAAGAAAA TAC CTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 2100 

CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 216 0 

AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 2220 

TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 22 80 

AG AGG TTAAG TGACGATCAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTTCA 2 34 0 



45 
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TTAATAATAC TTCAATAATT GTTAAAAGGG 
ATAATGATGT TGTATGATTC AAATTACGTA 

5 

GCATACAATT ATGAAAATGA GAACAATTGC 
AACAGGCGCA ATTACAGTAA CGACGCAATC 
AGTTGACAAA GTACCAACGC TTAAAGCAGA 

to 

TGCAAATTCA GCGACAACAC AAGCAGCTAA 
AAAGGCACCA AATACTAATG AGGAAAAAAC 

, 5 ACCTAAACAA GAAGAGCAGA AAACGCTTAA 
ACAATCACAA ACGACAACCG AATCCACAAC 
AACAAACACG CCACAACCAA TGCAATCTAC 

2£? AAAACAAGCA CAAACAGATA TGACTCCTAA 

ACCGAGTTTT GAATTTGAAA AGCAGTTTGG 
GTTTATGAAT GTTATTCCAA ATAGGTTCAT 

25 

GAAAAAATAT AAAGATGGAC CTTACGATAA 
T AAAT AT CAA TTGAAAAAAT ATTCTGTCGG 
TAATCACAAA GTAGAATTAA GCATTACTAA 
TGTTTCAGAA TACATGATTA CTAAGGAAGA 
GAGAAAACAA CTTATTGAAA AACATAATCT 
35 TATTAAAATG AAAAACGGTG GGAAATATAC 

TCGTATGGCA GACGTCATAG ATGGCACTAA 
ATCATGACAT TCTCTAAATA GAAGCTGTCA 
TTTACATGTT GCTTAGCTTC TTTTATTATG 
TGTTTGTAAA AGTGGCATTT CTATGTCTTA 
TTGAATCACA TCAAAATCAT TTTTATTTAA 

45 

CGATGATTCA AATATAGTTA AACAAGGTTT 
TAAAGCTGTA TGATTCAATG AATGTAATCG 
50 CAACTATGAA AATAACAACG ATTGCTAAAA 

GTGTAATCAC AACGACAACG CAAGCAGCAA 
AAGCACCACA ATCAACACCG CCCTCAACTA 

55 



GTTTAATGTG ATTATCTTAG AACGCCATCT 24 6 0 

AAAAGACAAT CGAATATAAT ATAGATTGGA 2520 

TAAAACCAGT TTAGCACTAG GGCTTTTAAC 25 80 

GGTCAAAGCA GAAAAAATAC AATCAACTAA 264 0 

GCGaTTAGCA ATGATAAACA TAACAGCAGG 2700 

CACAAGACAA GAACGCACGC CTAAACTCGA 2760 

CTCAGCTTCC AAAATAGAAA AAATATCACA 2 820 

TATATCAGCA ACGCCAGCGC CTAAA CAAGA 28 80 

GCCGAAAACT AAAGTGACAA CACCTCCATC 294 0 

TAAATCAGAC ACACCACAAT CTCCAACCAT 3 000 

ATATGAAGAT TTAAGAGCGT ATTATACAAA 3 060 

ATTTATGCTC AAAC CATGGA CGACGGTTAG 3120 

CTATAAAATA GCTTTAGTTG GAAAAGATGA 3180 

TATCGATGTA TTTATCGTTT TAGAAGACAA 324 0 

TGGCATCACG AAGACTAATA GTAAAAAAGT 33 00 

AAAAGATAAT CAAGGTATGA TTTCACGCGA 33 60 

GATTTCCTTG AAAGAGCTTG ATTTTAAATT 34 20 

TTACGGTAAC ATGGGTTCAG GAACAATCGT 34 80 

GTTTGAATTA CACAAAAAAC TGCAAGAGCA 3 54 0 

TATTGATAAC ATTGAAGTGA ATATAAAATA 3600 

TCGGAAAAAC AAGAAGTTAA GTGACAACGG 3660 

CGTAATGATG TAAAAAGACG AATATTCATT 3720 

AAAGTGACGA AACTTCAAAT GTGCCAAGTG 37 80 

CGAACATTAT GGATTTCTTA ATTTACTTAA 3 84 0 

AATGTGAATG GAGCAATACG CCATCTATAA 3 900 

AACAAATCTA ATAATTACGA ATGGAGCATA 3 960 

CAAGTTTAGC ACTAGGCCTT TTAACAACAG 4 020 

ACGCGACAAC ACTATCTTCC ACTAAAGTGG 4 080 

AAATAGAAGC ACCGCAATCA AAACCAAACG 414 0 
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15 



CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 426 0 

CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 4 320 

AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 4 3 80 

GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATT ATTT CA 44 4 0 

TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA TGGTGAAGGA GTACATAGGA 4 500 

ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA TCTGGAAAAA TATTCTGTCG 4 56 0 

GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 4 62 0 

AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 4 680 

AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATTGAA AAAAATAATC 4 74 0 

TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 4 80 0 

20 CGTTTGAATT GCACAAAAAA TTACAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 4 86 0 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 4 920 

ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 4 980 

* 5 GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 5 04 0 

AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 5100 

CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 5160 

30 

AATG TGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 522 0 

AACAAATCTA ATAATTACGA ATGGAGCATA CAACTATGAA AATGACAGCA ATTGCGAAAG 52 8 0 

35 (2) INFORMATION FOR SEQ ID NO: 54 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 6 base pairs 
(3) TYPE: nucleic acid 
CO STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 

AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 6 0 

AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 120 

GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CAT GGT AATG CAGCAATGCC TCAAACCAmA 180 

TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 24 0 

AAGCCGTTTT CAAT CGT AT A TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 3 00 
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GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTGCT 4 20 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 4 80 

5 ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 540 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 6 00 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGGATTA 660 

10 

TCACTTATCC TTG TATTT AT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 7 80 

ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 84 0 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 
(2) INFORMATION FOR SEQ ID NO: 54 6: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 546: 
GGCATTTGTG TCCTTATATA AGGAACTGTG t TAAATACAT TACTGTTGTT AAGTTGTTTT 60 

30 

TGTAATTCAA AGAGCAGAAC AGAGTAACAT CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 
AACAACTAAA TGAAATAATG AAAGTCATTT AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

35 ATTAAGAATA ATTCAAGTAT ATTTAAATCG AGGTTAATTA TCGTATGAAA CGATGCACGT 24 0 

TATAATAAAA ATGTATGATT CAAATTACGT AATGAAAACA ATCCAATATA TTAAGATTGG 3 00 

AGCAAATAAA TATGAAATTT ACAGCATTAG CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 360 

40 CAACAGGAAC TTTAACAACA GAAGTT CATT CAGGTCATGC AAAACAAAAT CAAAAGTCAG 4 20 

TAAATAAACA TGACAAGGAA GCATTATACC GATACTACAC TGGAAAGACT ATGGAAATGA 480 
AAAATATTAG TGCTTTGAAA CATGGTAAAA ACAACTTACG TTTTAAGTTT AGAGGTATTA 54 0 

45 

AGATTCAAGT TTTACTGCCT GGAAATGATA AAAGTAAATT TCAACAGCGT AGTTATGAGG 600 
GGTTAGATGT TTTCTTTGTT CAAGAAAAAA GAGATAAGCA CGATATATTT TATACTGTTG 660 
50 GTGGTGTAAT ACAGAATAAT AAAACATCTG GAGTTGTCAG TGCACCAATA TT AAA TATTT 720 
CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA AAGGTTACCC TTATTACATT AAAAAAGAAA 7 80 

AAATAACACT AAAAGAACTG GATTATAAGT TGAGAAAGCA TCTAATTGAA AAATACGGAC 84 0 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT TTAAATATAT GGGGGAAGTC ATAGAAAGCA 960 

AACAAATTAA AGATATTGAA GTTAACTTAA AGTAAATCAT TACGAATAAT TAAAAGTAAT 1020 

5 

TGAAGCGGCT TAACGGTGAA ATGTAAATTG GTGCGCATAG CTTATACAAA AAGGATGCAT 10 80 

CAATCGATAT CGTCGTTAAG CCGTTTTGGT TTGTGTGTCA TGAATCCTAT CCCAATCTCC 114 0 

ATAAAGGTAA AATTTCCACC ACCAACATCA AAATTCTCCA CATCGCAACA TAACCAAATG 1200 

w 

TTATAATAAA TCTATTACAC AAAGAGATAA ATTACTTATT CAAAGGCGGA GGAATCACAT 1260 

GTCTATTACT GAAAAACAAC GTCAGCAACA AGCTGAATTA CATAAAAAAT TATGGTCGAT 132 0 

is TGCGAATGAT TTAAGAGGGA ATATGGATGC GAGTGAATTC CGTAATTACA TTTTAGGCTT 13 8 0 

GATTTTCTAT CGCTTCTTAT CTGAAAAAGC GGAACAAGAA TATGCAGATG CCTTGTCAGG 144 0 

TGAAGACATC ACGTATCAAG AAGCATGGGC AGACGAAGAA TACCGTGAAG ACTTAAAAGC 1500 

20 AGAATTAATT GACCAAGTCG GTTACTTCAT TGAGCCAGAA GATTTATTCA GTGCGATGAT 156 0 

TCGTGAAATT GAAACGCAAG ATTTCGATAT CGAACACCTG GCGACGGCAA TTCGTAAAGT 1620 

TGAAACATCA ACATTAGGTG AAGAAAGTGA AAATGACTTT ATCGGTCTGT TCAGCGATAT 168 0 

25 

GGATTTGAGT TCAACGCGAC TAGGTAACAA TGTCAAAGAA CGTACTGCTT TAATCTCTAA 174 0 

AGTCATGGTT AATCTTGACG ACTTACCATT CGTTCACAGT GACATGGAAA TTGATATGTT 1800 

3o AGGTGATGCA TATGAATTCC TAATTGGGCG CTTTGCGGCG ACAg CGGGT A AAAAAGCAGG 1860 

CGAGTTCTAT ACACCACAAC AAGTATCTAA GATACTGGCG AAGATTGTCA CAGACGGTAA 192 0 

AGATAAATTA CGTCACGTGT ATGACCCAAC ATGTGGTTCA GGTTCACTGT TGTTACGTGT 1980 

35 TGGTAAAGAA ACACAAGTGT ATCGTTATTT CGGTCAAGAA CGTAACAATA CTACATACAA 204 0 

CTTAGCACGC ATGAATATGT TATTACATGA TGTGCGTTAT GAGAACTTCG ATATCCGTAA 2100 

TGATGACACA TTGGAAAACC CAGCCTTTTT AGGCAATACA TTTGATGCGG TTATTGCGAA 216 0 

40 

CCCACCGTAT AGTGCGAAAT GGACTGCAGA TTCAAAGTTT GAAAATGACG AACGATTCAG 222 0 

TGGTTACGGC AAACTTGCGC CTAAGTCTAA AGCAGACTTT GCCTTTATTC AACACATGGT 22 80 

ACATTACCTA GACGATGAAG GTACCATGGC CGTTGTACTC CCACATGGTG TATTATTCCG 234 0 

45 

AGGTGCTGCA GAAGGTGTCA TTCGTCGTTA TTTAATTGAA GAAAAGAACT ACTTAGAAGC 240 0 

TGTGATTGGT TTGCCAGCGA ATATTTTCTA TGGGACAAGT ATTCCAACAT GTATTTTAGT 24 60 

50 ATTTAAAAAA TGTCGCCAAC AAGACGACAA CGTACTATTT ATCGATGCAT CCAATGATTT 252 0 

TGAAAAAGGA AAAAATCAAA ATCATTTAAG CGATGCCCAA GTCGAACGTA TTATAGACAC 2 580 

ATATAAGCGT AAGGAAACAA TTGATAAATA TAG CT ACAG C GCGACACTAC AAGAGATTGC 264 0 
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GATTGATTTA GATCAAGTCC AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 276 0 

TGAGCAAGAA ATCAATG CAT AC CTG AAAG A ACTTGGGGTG TTGAAAGATG AGTAATACAC 2 82 0 

AAAAGAAAAA TGTGCCAGAA TTGAGGTTCC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 2 8 80 

AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 2 94 0 

CTTTAACAAT ATCCGGACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 3 000 

TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGG AGAATT C GCGTATAACA 3 06 0 

AAAGTTATTC TAATGGATAC CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 3120 

1S GTGTATTGTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAG AC IT CA 3180 

TGGAAGCATA TTTTGATTCG ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 3 24 0 

GTGCAAGAAA TCACGGATTA TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 3 300 

20 AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 3 36 0 

AAATTGAATT AGAAGAACAA AAGCTTGAAT TACIT CAACA ACAGAAAAAA GGCTATATGC 342 0 

AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 34 80 

25 

GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 3 54 0 

AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 360 0 

ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTG CAT ATA 3660 

30 

ATTCTATGAG AATGTGG CAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 372 0 

CTGCATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 3780 

35 TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACAT CAG ATA 3 84 0 

CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 3 900 

AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 3960 

40 TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 4020 

TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 4080 

AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 4140 

45 

TTTGTGTCGA AATTGTGTAC AGAATAAGTA GTTAAATAAA GATTAAGTTG AG AT AAAG TG 4200 

TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 4 260 

50 TTAAATTACT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 4320 

TTGCTCATTA TAATGA 4 336 
(2) INFORMATION FOR SEQ ID NO: 547: 
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(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

'0 TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTCAGGTT TAATTTCAAA 6 0 

GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 12 0 

AT AA CAT AAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 180 

15 AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 24 0 

AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 3 00 

TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 3 60 

ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 4 20 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 4 80 

TTCTTTA 4 87 
(2) INFORMATION FOR SEQ ID NO: 548: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi] SEQUENCE DESCRIPTION: SEQ ID NO: 54 8: 

TTGGTGGTGC AG CAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 6 0 

TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 120 

40 

CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 180 

TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TTTAGTATTT AACTTAAAAG 24 0 

45 AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCACTTTCA GGAATTATTT 3 00 

TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 360 

GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 4 20 

50 TTATCCAAAA GGC AAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 480 

AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 54 0 

CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 60 0 

55 



1486 



EP0 786 519 A2 



TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 



720 



CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 



780 



TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 



840 



ATTCACGTGT CGCACtAGTA ATTACCAnAG T 



871 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 54 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 9: 

TCAATCTAAT ATATACTTCA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 60 

TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 24 0 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 3 00 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTCGTCC TTGTTCCTCA aAAwAGCTTT 3 60 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 4 00 
(2) INFORMATION FOR SEQ ID NO: 55 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 60 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 

CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 18 0 

TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 24 0 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 3 00 

TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 36 0 
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TTTGTTTTAA TGCATCAATG AGTGCTGTTT GATTTTCAAC AATTGGACCT GGCAACTCTT 4 80 

TTTTATAATC CATGTAAAAA CCTCTAAGCT CATCGCCATA TTTATCTAAG TCATATGCAT 540 

5 

AGAAAATTTG CGGACGCTTT AATACACCGA AGTCGAACAT GACAGATGAG TAGTCGGTAA 6 00 

CTAACGCATC GCTGATTAAG TATAAATCCG AAATGTCTTC ATAAtCTGAA ACGTCTTTCA 660 

CAAAATCATC ATGTTCATCA ATACGTGTCA CAACTAAATA ATGCATGCGT AAkAAAATAA 720 

10 

CAT ArT CATC ATCCAGCGCT TGACGCAAAG CTTCTATATC AAAGTTAACA TTAAATTGAT 7 80 

ATGAACCTTC TCGAATCGCT TCATCGTCAC GCCAAGTTGG CGCGTACATA ATCACTTTTT B4 0 

is TATCTAATGG AATATTTAAT CTTGTCTTAA TACCATTAAT ATATTCAGTA TCATTGCGTT 9 00 

TATGTGATAA TTTATCATTT CTTGGATAAC CTGTTTCCAA AATCTTATCT CGACTAACAT 960 

GAAATGCATT TTGAAATATC GATGTCGAAT ATGGATTAGG TGACACTAGA TAATCCCACC 1020 

20 GTTGGCTTTC TTTTTTAAAG CCATCTTGGT AATTTTGAGT ATTTGTTCCT AGCATTTTAA 10 80 

CGTTACTAAT ATCCAAACCA ATCTTTTTTA ATGGCGTGCC ATGCCATGTT TGTAAGTACG 1140 

TCGTTCGCGG TGATTTATAT AACCAATCTG GTGTACGTGT GTTAATCATC CwCGCTTTCG 12 00 

25 

CTCTTGGCAT CGCTAAAAAC CATTTCATTG AAAACTTTGT AACATATGGT ACATTGTGCT 12 6 0 

GTTGGAATAT GTGTTCATAT CCTTTTTTCA CACCCCATAT TAATTGGGCA TCGCTATGTT 13 20 

CAGTTAAGTA TTCATATAAT GCTTTGGGGT TGTCGCTGTA TTGTTTACCA TGAAAGCTTT 13 80 

30 

CAAAATAAAT TAGATTCTTG TTTGGCAATT TTGATAGTAA TTTAAAAGTC GTATATATAC 144 0 

TATGTTCTAT CAATTTTTTA ATTGTATTTT TAATCATGTC GTACCTCCGA CGTGTTTTTG 1500 

35 TAATTATATT AATATGTATG AGC 1S23 

(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 923 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT TTTTAAATTT AATTTCGAGA 60 

SQ TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 120 

CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG AAAAAGCTAA 180 

TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 24 0 
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ATGTGGACAA TAAATCTTTT TTCTGCGGAT TATCTAACTC ATAATCAACA TGTGTCACAT 3 60 

TATAACGTGC TTTTTTAGAA AGGCTAGCTA AAATTTGCTC GTGAAAAGCA GTTAATGAAT 4 20 

5 CTAAATCTAA TTTGATTTGT AATAGGAAAT TGTTATTAAG TAAATATAAG TCGTTTTGAT 4 80 

AACGCGACAA TTTGTTTAAT ACTTCATAAG CTTGTCTAGT CGTCTGAACT ACTTCTCTGA 54 0 

AAAGTATTTT CTTTCTATTC TGTTGGTGAA TATGTTTTTT TGTAATAGGA CGTTCTTCGC 600 

10 

TATAGTAATC ATAAATTTTC TCTAACTTTT CGACACGTTG TTTTAAATTA TGACTATCTT 660 

GTTTAATATT ATTAAACTCC GTCGTATCAT TTAATACTAA TTTAAACCAC ATAAAAATAT 720 

;5 CTGAGGATAT ATTTAATGAA TTATAGTAAA TTTTTGTTTC AAATTTAGGT GGTAGAAACA 780 

CAAAGTTAAC TAGAGATGAA CTTATGACAC CAATCATTAC AAGTACAAAC CTGTAAAAGG 840 

CGGTAATATA GAAAGAACCG GTATGTTGTC CCATAATGAT TAATGCTGTT ACACTCGCCA 900 

20 AAGTAGCAAC ATGTGCTAAA TTAAATTTAA ATAAAATAGC AATAAGTACT ATGACGGTAA 960 

CACCCATAAT GATAAAATTA TCACTAAAAA TTGTTACCAT TGTAACAGAT AGTATGGCAC 102 0 

CTATAATGTT ACCCAATGCT TGATCAGAAA CTGTTTTAAA TGAACGATAA ATACTAGGTT 10 8 0 

25 GCATTGCACA ACAGCACTGA CACCAGCTAA GGCTTTCAGA CCAACATCAT CCGGTAGTAA 114 0 

AGAAGCGATA GACATAGCTA AAATAATGGC TATACCAGTT TTAAAAATCC GAGCTCCTAG 1200 

TCTCAAAAAT AATGACGCCC CTTTTAAGTT TATTGAATAT CTAATATTCG TATTCATTAC 12 6 0 

30 

TGTTATACAC TTACTAGTTA CAAAATTCAA GCTTATTTAT AGTTGTTAAA ATAAATCATA 13 2 0 

CATAATACTG ATAGCGATGT AAAACTTTAG TCAGAGATTA AAATAGTATA AATTTGTAAA 13 80 

ATAAAAACTC ACATAGTGAC ATATCAAGTT AAACGTTAAT AGTTAACGAT ATAAAATGAA 144 0 

35 

TCTACTATGT GAGCATTTGC TTTATTTTAA TTCAATTAAA AATATACTTC CTTAAAAGTT 150 0 

ATTTCATTTG ACTAAAAGCA TAGTCTGCAG CTTTTAAAGT TTGTTTAATA TCTTCTTCTG 1560 

40 TATGTTCAGT TGTTAAGAAC CAAGCTTCAA ACTTAGAAGG TGCTAAATTG ATACCTTGAT 1620 

TTAACATTAA TTTGAAAAAT TTACCGAACG CTTCGCCGTC AGAATGTTCA ACTTGATCAT 16 80 

AATGTGTGAC TTTTTCATCT GTAAAGTACA ATGTTAAAGA TCCATAAATA CGATTAATTG 174 0 

45 TAGCTGTGAT ATTATGTTTT TCGATTAATT TAAGTAAACC TTCTTCTAGT TGTTGGCCTA 18 00 

AGCTGTCTAA TTTTTCATAA ACACCGTCTT GTTCTAGTAC TTCGAGTAAT GCAATACCTG 186 0 

CTTTCATAGA TAACGGGTTA CCAGCCATTG TACCAGCTTG ATATGCAGGT CCTAGAGGTG 1920 

SO 

CTACTTGTTC CATAATATCT TGACGTCCAC CATAGCCTCC AATTGGTAAA CCACCGCCAA 1980 

CAATTTTACC AAATG CAGTT AAATCAGGGA TAACACCTAA TAAATCTTGA GCGGCACCGT 204 0 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGGCAT TACCATTCCA AAGTTACCAA 2160 

CAATTGGTTC TACTAATACT GCGG CAATTT CATCACCCCA AAATTCAATT GCTTCTTTAT 222 0 

5 

AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 22 3 0 

CAGCTGAGTC TGGAGAACCG AGCTGAGATG GGCCGCTACC TGCTGCAACC AATACTAAAT 234 0 

CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA GTATATGCAC 24 00 

10 

GTGCAACACG AATTGTTGTC ATGACTGCTT CTGTTCCAGA ATTTACAAAG CGAATTTTCT 2460 

CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2520 

15 TACCAAATAA AACACCTTTA GCAGCTTGTT CTTGAATTGC TTTAGTAATA TGAGGATGTG 258 0 

CATGCCCCGT AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTGCCATCGA 2 64 0 

CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCG CCTCCTACAG 27 0 0 

20 CTTTATAAGA ACGAGAAGGG GAATTGACAC CGCCTAGAAT ATATTCGTTT GAAAGTTGTT 27 6 0 

GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 2 82 0 

TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2 88 0 

25 

GGAGAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 2 94 0 

ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 3 000 

ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 306 0 

30 

GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 312 0 

TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 318 0 

35 AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 324 0 

ATGAACAAGG TAAAGTATTA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 3300 

AACTTAAAAA CATTTTGGGG TGACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3 360 

40 AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 34 2 0 

ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTTGGTT ATGATGGCGG 34 8 0 

TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 354 0 

45 

GGGTG TAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3600 

AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3660 

TAAAAAGATG AAACTATCAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3 72 0 

50 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3 780 

AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 84 0 
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w 



TGCTGACATT ATTATAAATG CTTTACCAGA AACGCAAGAA ACGATTCaTT TaCTAAAGAA 3 96 0 

AAAACATTTT GAATTAATGA AAGATGAAGC ACTTTTTATA AATATAGGAC GAGGTAGCAT 4 02 0 

AGTTAAAGAA GCGCTCTTAA TAGAAGTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4 08 0 

AGATGTGTTT GAAAATGAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 414 0 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4 2 00 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTG a 4 26 0 

TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

JS AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4 3 80 

TTGAAATAGA ATCAATTGAA CATGAACTAG AAGAATCAAT TGCATCATTG CGACAAGcAG 444 0 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4 500 

20 ATCCAACAsC TGaTGaAATT TATCAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4 56 0 

CGACAATATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4 620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4 6 80 

25 

AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 4 74 0 

CTCAGCATAT GACTGACTTT GACGTAACAC ATCATCGAAT GGAAATTTAT GGAG TTTGT A 4 8 00 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTw 4860 

30 

ACTTCATATA AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4 92 0 

TTT 4923 
35 (2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
■to (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

45 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 6 0 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 12 0 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyrGAA 180 

50 

CGCATTTTGG TACCCAAAAt GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 24 0 

GTCGATTGGT TTCATTAATT GAaCACCATT TTGCTCTTCA GCAAGACCTT TGTCTACTTT 3 00 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 4 20 

TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 4 80 

ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CAC CACTTTG 54 0 

GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 600 

ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 660 

AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 720 

TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 780 

ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCGC AATAAATACT B4 0 

AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 900 

CTTTTAGAAG TATGAAA 917 
(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3b 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 

ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 6 0 

TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 120 

AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 180 

TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 24 0 

AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 300 

40 AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 360 

ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 42 0 

ATCTCTTTCT CT 432 

45 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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TTTTGCTTTA TGTXAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 6 0 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATATCTTCCT 120 
5 ACTTCTACAT TATTAAAATC ACTAC CAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 18 0 

TTCCTGATTT TACACCATCT ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 24 0 

TACGCCAAGC GATGAATCTT AACATTCGAT TCTTTAGAGG ACTGGTTTTT CAGCAAACCA 3 00 

10 

ATTATCTTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 3 60 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 4 20 

/5 TAGCATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 4 80 

TTAGATTGTT ATAGGTAATA CCACCTACAA TAACCGTACC TTTATCTAGT ACTTCCGGTC 54 0 

TTTCTGTGGA AGTTCGAATA AGGACACCAG GGAACTTCAA AATAGACGAC TCTTCTGACA 600 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 6 60 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATTT 720 
TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGGCAT TTTATACTTT TTGGCAATAT 7 80 

25 CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATCGATATTC TCTTCTCTAT 84 0 

GCGCAGaTAC TAAAATGTAT TGTTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900 
GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT G Ty ACAAAG A 960 

30 

TATTCGCTTT ATTGAAG CCT TCATCTAATA AATAACGTCT GCTATGTTCC GTATAAGGTA 1020 
GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 1080 

35 CAAAGCATCT ATTACCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 114 0 

CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 1200 
ATAAAACATC ATATGTCTTC G CAATAAT AT TCCCCAtCGT TTCTCCAnGt TACTTCCAAC 126 0 

40 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCAAAGAAAA TTTGATTCAA 1320 
TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 1374 
(2) INFORMATION FOR SEQ ID NO: 555: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

55 
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TTACCTCTAG GCATGTCCCT TTCACGGTTT GCTTATGATA ACGTTATCGA CATTGTTAAA 12 0 

TTGTCCTTTT TGAAAAAATA ATTCTGGTGA GATAGACGCT ACTTGGaTTT CaTCaGTTTG 180 

5 TAAtAACGCA GtATAAtTAC CaTTACTAAA tTGaGTTAAT CGTTCaTATA AAGTACTAAT 24 0 

AGGATAATAA ATGTTATCCG TTAAgCGCGC CGTATAGTTC ACTTGATACG TTTCGCCTTC 300 

AACAATTGCT TGTTGGACAC GTTTAATATT AGTCATCATA ACCTCAGAAG ATTCAACAAA 360 

W 

TGAAAAATGA TACTTTGATA CATAAGAAGT TTGATGTTCA TATGTTGAAT TTATGCTTTC 42 0 

CGCTTTTTCA AAACTATAAG CTGCTGCATA AATATCATCT TTAGCTAATG AATGTGTACA 4 80 

1$ CATAGCATGA TTAAAATACT TTGCCGCTTC GTAACTTAAA TATAACGAGA CATATCTACC 54 0 

TTGTCGTkGt GCGCTTGTGC AAAGTGTATC ACTTCTCCCA CATCAGCCAA CTTAGTAGCA 600 

ACATACTTCT TTATAAATCC CTTTAATTGA ATATGGTATT GCTTATATTC ATTTTCAGTT 660 

20 AAATAGTAGC GATAATTATA TTCTATTCTC ACAGTAATCA CCTACCTTCG ATAAAAATAA 72 0 

TTCAACTTGT CGATAACCGT ATTCACTCAA AAT AG ATT C A GGATGATATT GCACACCAAA 780 

AACCGGAAAT CTAATATGCT CAAATGCCAT AATAATCGCT TCATCGTTTT TTGCTGTAAT 84 0 

25 CTTTAAG CAA TTTGGAAAAG TCGCTCCGTC AGCAATTAAT GAATGATAAC GCATTACATT 90 0 

GAAATTTTGA GGCAGTCCTT GAAAAATACC TTCATTGGTA TGGCGTAACT GTGTAGTATG 96 0 

TCCGTGTACA GGATGATAGC CGTGAATGAT ATTTCCACCA AAATAAGACA CGATACATTG 102 0 

30 

AAATCCTAAA CATACACCTA G TAT AGG TAC ACGCTGATAA AATTGTTCTA ACACTTCATT 108 0 

CAAGATAGGA TAATCATCCG GaTTACCCGG CCCAGGCGAA ATAACAATTG CTTTTGGCTT 114 0 

J5 CATATTAATG ACGTCTTCTA TCAGCAGATT ATCAATACCA ACAACTTGAA CTGTTAGTTT 120 0 

CGTTTGAGTC TTAATATAGT CTATTAAATT ATATGTAAAT GaATCATTAT TATCTATGAC 12 6 0 

TAGAATCATT GTATACTCCG TTCTAAATGT GTTTTATTTT TATAATATGT ATTGGATGTA 1320 

40 GCTAAAACTT TAAAAGCATT GTCATTATCC TGACCTTGAT TTTAACTAAT ATATGGTATA 13 8 0 

TTCTATTCAT CGTACATAAA TGAATATCAG AGGTTCCTAG CTGAAACCCT CTATAAAAAA 144 0 

CTAGGCCATT GAAATTTCAA ACATTCGTTG GG 14 72 

45 

(2) INFORMATION FOR SEQ ID NO : 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 
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AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

5 GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 180 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 24 0 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 3 00 

10 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 3 60 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 4 20 

15 TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 4 80 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAG AT CC AG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 7 80 

25 

CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 840 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AGCGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

30 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 1020 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
35 (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

45 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 

AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 12 0 

GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 180 

so 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 24 0 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 3 00 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 4 20 

TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGG CAT TTTCATTAAA 4 80 

5 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 54 0 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 600 
ATTGAATGCA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 660 

10 

ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG 72 0 

ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 780 

f5 AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 84 0 

TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 90 0 

ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 96 0 

20 GATGaACAGG t TATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 102 0 

GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 10 57 

(2) INFORMATION FOR SEQ ID NO: 556: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

35 CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 6 0 

TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 12 0 

GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 180 

40 GATGATTAAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 24 0 

CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 3 00 

GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 3 60 

45 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 420 

TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 4 80 

CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 54 0 

50 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAAC CAGTTT 600 

TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 

55 



1496 



EP0 786 519 A2 



AAACTGGAAT ATCAATAAAT TCTTTTTTAG TCATCTTTTC ACTTTCATTA GAATCTATTA 780 

TAGTATAACT TGGTAATAAA GATGAATTAT TTTTCTTATT ATTATATTCA TTTGTGCATA 84 0 

5 

AAATATGACG CTTTTGCTTA ACATTATTCA ATTTCATCGT CATTTTACCA TTACTTATCA 900 

TTTCAAGCAT TTCTTTTTTA GCTTTTTGTA GAAAGGATGC TATTTTGTAA TTATTTTAAT 960 

AATCTAATAC TTTTTATCTC AGTTTCATCA AACGAATAAA CAGCAAAATC AGTCTGTATA 1020 

w 

TCTACAACAA GATTTCCTGT TTCGCTCTCG AATTCATTTT CAT AG T CAG T TACAAACCCT 108 0 

TCGTATTCTT CATTATTCAA AAGCGTGATA ATAACATCTT TACGATATGC ATCTnCAATT 114 0 

1S CTCAATACTT TTCACCTACT TATCAATATA AGGTACTATA TGaGcACCTG GTCTTCGAAT 1200 

AATGGCACTT cCCTTTCTGG TTTCAATATA CACATTTTCG ATATGTATTT TTCCAATAAT 1260 

TTGATTAAAA TTAATAATCT CTTTCAAATC AAATCGCTCA TCACTTAATA TTAGATTGCC 13 20 

20 TGTTGACATT TTTTCTCTTA ACAATTCATT CAATAAATCT ATAGAAAGTA TTGTATAGCT 13 80 

AGGCAATTTC TTATTATTTA AAATGGCTCT TTTTTTATTT TCATTATATA GGTGATGACC 144 0 

TAACATATAT CTATTTTATT TTCCGCAATT TATTTCTATT TTTATTTTAC CATTTTTAAT 1500 

25 

CATTTCTTTC ATCTGGTTTT TAGCTTTTtC CTGtAATTAT GCTTCTTTTA CTTCTACTTG 1560 

ATATTTACCt TCACGCTCTT TAAAGAACTT GTCCCGCCAA TTGCCAACAT GTGGCACTGT 16 20 

GGTACTTCTA CACCAAGGAT GCATAGGTGG CGCATTCACA CCTGGTATCA TATCTTTAAC 1680 

30 

TTTAAATATT TTTCCGTTAA GTGAATGACA TAATTTAGAT GTTTTACTAT CTATTTTGGC 174 0 

AACATATTTA TATTCGCCAT CTTCACCAAG TTCTTTTAAA TATGTTAACT TTTGTGCTTC 1800 

35 TGCATTTTCA GTAAATAGTT AAAAAAGCGT ATAAAAATAG CACCACTTTC TCTTTAkCTG 1860 

TCTAAAAAGG ATGCTATTTA TCTTTTGAAT TTGAATTCTT TTTCGCTTTT TCTATACTTT 192 0 

CAAATTCTTC AACTAATTCT TTAAAAGATT CACTCAATTC TTTTGCAGTT ACATTTCCAT 1980 

40 CTAATTGTGA ATCTAACATA ATTAAAATCA TCTCACTTTA TATTTAATCa TATTTATACT 2 04 0 

ATAAAGTTTT TTCAATAATT TTTCAATATG GCTATCATTA ATGATATTAA TATGCGTAAA 2100 

ATATTTAGCA CAAAATTTAC TCACTATTTT ACCATGAAAC CTATTTGACT TGGTAATAAA 2160 

45 

TTTTACTTGT CCCTTATTAG TAACGATTGT CATTGATTTT ATTGATGGAT GCTTAAAAAA 2220 

TGTAAATAAA TCATATTCTG AAAATCCTGA CTGTCCAGGA TGGTTATGTA ACATAACAAT 2280 

TGAATTCGGT TTACTGTTAA ATAATAATTC GGTTGCTTGT TACCCTGGCA CAAAAGATAC 234 0 

50 

ACTATCTTGA TTGACATATA C TTTTGT AAA TTTACCATCT TTTAACAAAT AAGCTACTTC 24 0 0 

ATTGCTATCA TTGTTTTCCA TGAGATATAC CTCCGTTTAT AGtCTGTCGA CTGATATTCC 24 6 0 
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GAAGGTTAAA ATTTTTAAAG TAAACTTTCT TTATCCCCAT GCTACGAGTT CAGATTCAGG 25 8 0 

AAATAGCCCT TTACTAGTAT TTATGTATAT TCTGTCTATG GCATGAATAA AAT AATT AT C 264 0 

5 TCTTGTATTT tTTTCTAAAC TAGATTTTTC AG CATTG ATA ACTTCAAGAC TATCTATATC 2700 

CATTTGAATA ATACCAGGCT TAATATTTTC ATCATTATTA GGAAAATATT TATATGTAAC 2760 

ACTTTTATCA TTAATTTCTT TTATTTTTAA TATTAGCAAT CATTTCCACC TCT AATT AAT 2820 

10 

TAAAATACTA TAATTATATT TTATTTCTGT AAGTTTATGT GCCTCTATAT AGTGTAAATT 2 8 80 

ATACTTATTC ATT AG AT AG T GTTCAAGAGC TTCATGTTTC TACATTATTA TAT C CATTTT 2 94 0 

TTTAATATTT TTCCCTTCTC TTAAACGTTG CCAACTTTGA G CCA TAT AAA AGTCAGGATC 3 000 

15 

AAATTGTTTA AATCCACTTT CTAATAAATA CTTATTTTCA AAT AT A T G TT CATAAACTCT 3 06 0 

TTGAATTAAA TTTTTATTTA TATTAgTATT TTTAGCAATT TTAGAAATCT CTATCTGTTT 312 0 

20 ATCTCGATTT CTAACTGAAT TATAATAAAT TTGAGCATGT CTGTTCCTTT TGATACCGTA 3180 

TTCATCACTT TTATTATTAA GTGCACCTGA TTCAATAAAA CAACCTTCTA CTTGATATTT 3 24 0 

ACCTTCACGC TCTTTAAAGA ACTTGTCTCG CCAATTGCCG ACATGTGGCA CTGTGGTACT 3 300 

25 TCTACACCAA GGATGCATAG GTGGCGCATT CACACCTGGT AT CAT AT CTT TAACTTTAAA 3360 

TATTTTTCCG TTGAGTGAAT GACATAATTT AGATGTTTTA CTATCTATTT TGGCAACATA 34 2 0 

TTTATATTCG CCATCTTCCA CCAAGTTCyT TTAAATATGT TAACTTTTGT GACTCtTCyT 34 80 

30 

TTTCAACGAA TAATGAAAAA AGCATATAAA AATAGCATCG CTTTCTCTTT ATCTGTGTAG 354 0 

AAAGGATGCT TTAATACCaT GCTATTTTAT AATTTTCGGG AAATTcTTGC TTCTCGATAA 3600 

AGTCTCTTAC TACAGAAAAA GACTTATTAC GATATAACAT AAAATATTCT TCATTTTCTA 3660 

35 

TTTCAGATAA TGAGAAATCT ATAACCTCTG CATCTTTTTT ATTAAAAGTT ACTGAACCTT 3720 

TACCGTTACT TATATCATCT TCAGGTATAT ATTC 37 54 
40 (2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

50 

ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCG AAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 
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CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 24 0 

CTGAAATGAA TATTACGGTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 3 00 

5 GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 3 60 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 4 20 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AG ATGAAC CA TTAAATAATT 4 80 

10 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 54 0 

ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 600 

1S TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 6 60 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TG AA CAT AT A AG AAA TC AAA 720 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 780 

20 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT 815 

(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 560: 
GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 60 

3S CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 120 

TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 180 

TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 240 

40 CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 3 00 

AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 360 

TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 420 

45 

GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 4 80 

GTGACGTACA TGATCCGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 540 

GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 600 

50 

CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 660 

GACAAAATTT ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA yGnyGACATT 720 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 84 0 

GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 900 

5 AAATTTGGAT TTCATAAGG 919 

(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



is 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 

ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 60 

CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 

AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 

ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 24 0 

TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 300 

AG CGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 36 0 

TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 420 

TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 4 80 

AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 518 
(2) INFORMATION FOR SEQ ID NO: 562: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 
45 CTTATTTAAA AGTAAATCAA TCAATGTATT ATAATCCGAA TAGTCCGCAT AAAGCTGGTT 60 

TGCaGGCAAA tCAATTACTA CAACAAGCAA AAACCCAAAT TAATGCAATG rTTaATTCAA 120 
AAACAAATTA TGATGTTGTA TTCACTAGTG GTGCAcTGAA TCCAATAATC TTGCTTTAAA 180 

50 

AGGTATTGCC TATCGTAAAT TTGATACAGC GAAGGAAATA ATTACATCCG TGTTAGAGCA 24 0 

TCCGTCCGTA TTAGAGGTTG TAAGATATTT GGAAGCACAC GAAGGATTTA AAGTTAAATA 3 00 

55 
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